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NEUROTRANSMITTER TRANSPORTERS 
TECHNICAL FIELD 

This uxveaotiQa relates to uacleLC acid and amino add sequeaices of neurotransmitter transporters 
and to the use of these sequ^ces in the diagnosis, treatment, and prevention of transport, neurological 
and psychiatric disorders, and in tbe assessment of the effects of exogenous conq)ounds on tlie 
expression of nucleic add and ammo add sequences of neurotransmitter transporters. 

BACKGROUND OF THE INVENTION 
The human nervous system, whidi regulates an bodily functions, is conq)osed of the c^al 
nervous systmi (CNS), consisting of tlie brain and spinal cord, and the po-ipheral n^ous system 
(PNS), consisting of afferent neural pathways for conducting nerve impulses from sensory organs to the 
CNS. and efferent neural pathways for conducting motor impulses from the CNS to efiector organs. 
Tlie PNS can be further divided into the somatic nervous systan, which regulates voluntary motor 
activity sudi as for skeletal muscle, and the autonomic nervous system, whidi regulates involimtary 
motor activity for internal organs sudi as the heart, lungs, and viscera. 

The central nervous system (CNS) is composed of more than 100 billion neurons at the spinal 
cord level, the lower brain levd, and the higher brain or cortical level. Neurons transmit dectric or 
dsemical signals between cells. The spinal cord, a thin, tubular ext^sion of ^e central nervous 
system within Ihe bony spinal canal, contains ascending sensory and descending motor pathways, and 
is covered by membranes continuous with those of Ihe brainstem and cerebral hemispheres. The 
spinal cord contains almost the entire motor output and sensory input systems of ^ trunk and limbs, 
and neuronal circuits in the cord also control ihyflmiic movements, such as walking, and a variety of 
reflexes. The Iowa: areas of the brain sudi as the medulla, pons, mesencephalon, cerebellum, basal 
ganglia, substantia nigra, hypothalamus, and thalamus control unconscious activities including arterial 
pressure and respiration, equilibrium, and feeding reflexes, such as salivatioa Emotions, such as 
anger, excitemait, sexual response, and reaction to pain or pleasure, originate in the lower brain. The 
cerebral cortex or higher brain is the largest structure, consisting of a right and a left hemisphere 
interconnected by the corpus callosum. The cerebral cortex is involved in sensory, motor, and 
integrative functions related to perception, volxmtary musculoskdetal movements, and the broad 
range of activities associated with consciousness, language, emotions, and memory. The cerebrum 
functions in assodation with the lower centers of the nervous system, 

A nerve cdl (neuron) contains four regions, the cell body, axon, dendrites, and axon tenninal. 
Thecdlbody contains the nudeus and other organdies. The dendrites are processes whidi exteaid 
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outward from tbe cell body and receive signals from s^e organs or from the axons of other neurons. 
These signals are converted to dectrical impfulses and transmitted to the cdl body. The axon, whose 
size can range from one millimeter to more than one meter, is a singje process that conducts the nerve 
impulse away from the cdH body. Cytoskeletal fibers, including miaotubules and neurofflameats, run 
5 theliQigthof theaxonandfiuictionintranq)ortingprotd mionbrane vesicles, and other 

maaomolecules from the cdl body along the axon to the axon terminal. Some axons are surrounded by 
a mydin sheath made up of membranes from eitha: an oligodendrocyte cell (CNS) or a Schwann cell 
(PNS). Myelinated axons conduct dectrical impulses faster than unmydinated ones of the same 
diameter. The axon terminal is at the tip of the axon away from the cdl body. (SeeLodish. H. etaL 
10 (1986) Molecular Cdl Biolog y Sdojtific Ammcan Books New York NY, pp. 715-719.) 

CNS-assodated proteius have roles in neuronal signalmg, cdl adhesion, nerve regeneration, 
axon guidance, neurogaiesis. and other functions. Certain CNS-assodated proteins form an integral 
partof amanhraneorareattadiedtoamembrane. For example, neural membrane protdn 35 
(NMP35) is closdy assodated with neuronal membranes and is known to be highly expressed hi tb& rat 
15 adult nffvous systaa (Sdiwdtzer, B. et al. (1998) Mol Cdl. Neurosd. 1 1 :260-273.) Synaptcphysm 
(SY) is a major int^al membrane protdn of small synaptic vesides. The chromosomal location of S Y 
in human and mouse is on the X chromosome m subbands Xpl 1 .22-pl 1 .23 . This region has been 
m?)licated hi severd inherited diseases indudhig Wiskott-Aldrich syndrome, three forms of X-Hnked 
hypercalciuric nq)hrolithiaisis, and the eye disorders retmitis pigmentosa 2, congenital stationary night 
20 blindness, and Aland Island eye disease. (Fisher, S. E. et al. (1997) Genomics 45:340-347.) Peripherin 
or rdinal degeneration, slow protdn (rds) is an hitegral membrane glycqprotdn that is present in the 
runs ofphotarecq)tor outer segmmt disks. In mammals, rds is thought to stabilize the disk run through 
heterqphilic interactions with rdated nonglycosylated protdns. Rds is a mouse neurological mutation 
that is characterized by abnormal devdopmeot of rod and cone photoreceptors followed by their slow 
25 d^aieration. (Kedziarski, W.J. et al. (1999) Neurochem. 72:430-438.) 

43 KD postsynaptic protdn or acteyldioline receptor-associated 43 KD protein (RAPS YN) is 
thought to play a role hi anchormg or stabilizmg the nicotinic acetyldiolme receptor at synaptic sites. 
RAPS YN is involved in menabrane assodation and may link the nicotinic acetylcholine receptor to the 
underlymg postsynaptic cytoskdetoa (Buckd, A. et al. (1996) Gaiomics 35:613-616.) Neuritinisa 
30 protdn whose g^ is known to be induced by neural activity and by neurotrophins which promotes 
neuritogenesis. Neuraxin is a structural protdn of the rat caitral nervous system that is bdieved to be 
hmnunologically rdated to microtubule-assodated protean 5 (MAP5). Neuraxm is a novd type of 
nairon-spedflc protdn which is characterized by an unusual amino add composition, 12 central 
hq)tadecarq)eats and putative protdn and monbrane hiteraction sites. Hie gene encodmg neuraxm is 
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unique in th&li^loid rat g^ome and is conserved in higher vertebrates. Neuraxin is implicated in 
neuronal membrane-naicrotubule interactions and is expressed tfarougibout the rodent central nervous 
system (CNS). (Rienitz, A. al. (1989) EMBO J. 8:2879-2888.) 

NudCt a nuclear movement protein, interacts mth the lissmcqphaly product lisl, a 
5 protdn involved in neuronal migratioa People with Miner-Dieker syndrome (MDS) or isolated 

liss^cephaly sequence (ILS) have a hemizygous ddetion mutation in the LIS 1 gene. Both conditions 
are diaractedzed by a smooth cerebral surf ace» a thickened cortex with four abnormal layers, and 
misplaced neurons. LIS 1 is Mghly expressed m the ventricular zone and the cortical pla^ The 
interaction of Lisl with NudC» in conjunction with the MDS and ILS pheno^es, raises the possibility 

1 0 that nuclear mov^nent in the v^itricular zone is dosely related to neuronal fates and to cortical 
architecture. (Morris, S. M. et al, (1998) Curr. Biol. 8:603-606.) 

CNS-assdciatedprotehis can also bephosphoproteins. For exan?)le, ARPP-21 (cyclic 
AMP-regulated phosphqprotedn) is a cytosolic neuronal phosphoprotein that is highly enriched in the 
striatum and in other dopaminocqptive regions of the brain. The steady-state level of ARPP-21 mRNA 

15 is devdopm^xtally regulated. But, in the neonatal and mature animal, ARPP-21 mRNA is not altered 
foUowmg 6-hydroxydopamine lesions of the substantia nigra or by pharmacologic treatments that 
upr^ate the Dl- or D2-dqpamine receptors. (Ehrlidi, M. E. et al. (1991) Neurodiem. 57:1985- 
1991.) 

CNS-assodated signaling protdns may contain PDZ domains. PDZ domains have been found 
20 in proteins which act as adaptors in the assembly of multifunctional protein complexes involved in 
signaling events at surfaces of cell membranes. PDZ domains are generally found in membrane- 
assodated proteins including neuronal nitric oxide synthase (NOS) and several dystrophin-assodated 
protdns. (Pouting, C. P. et aL (1997) Bioessays 19:469-479.) 

CNS-associated protdns may also contain q)ida:mal growth factor (EGF) domams. The Notdi 
25 protdns are transmeshbrane protdns whidi contain extracdlular regions of repeated EGF domains. 
Notdi protdns, such as the Drosoohila mdanoeaster nioirogaiic protdn Notch, are generally hivolved 
in the inhibition of devdopm^ital processes. Other members of the Notdi family are the lin-12 and 
dp-l genes of Camorhabditis deeans . Genetic studies indicate that the lin-12 and g^l protdns act as 
receptors in spedfic devdopmental ceil interactions whidi may be involved in certain embyronic 
30 defects. (Tax, F. E. d al. (1994) Nature 368:150-154.)' Pecans a maternal-effect neurogenic locus 
of D. melanogastar is bdieved to encode a large transmembrane protein. In the absence of maternal 
e3cpression of the pecaneic gene, an embryo develops severe hypmiemialization similar to that 
characteristic of Notch mutant embryos, (LaBonne, S. G. et al. (1989) Dev. Biol. 136:1-1 16 .) Other 
CNS-assodated signaling proteins contain WW domains. The WW domam is a protdn motif with two 
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taigbly consesrved tryptophans. It is present in a number of signaling and regulatory proteins, including 
Huntingtin inta-acting protein. Several fibroblast growth factor (FGF) homologous factors (i e„ FHF 
polypeptides) have also been implicated in nervous system development based on mRNA ^pression 
pattems in mouse and human tissues. Members of tbe FHF famHy of polypeptides are stnic^ 
5 distinct from prototypicFOTs, consistent with the unusual role of these FGF-rdatedprotdns 
(SmaUwood, P.M. et aL (1996) Proc. Natl. Acad ScL U.S.A. 93:9850-9857 and Hartung, H. et al. 
(1997) Mech. Dev. 64:31-39). 

Alzheuno-'s disease (AD) is a dpgaio-ative disorder of the CNS winch causes progressive 
memory loss and cognitive decline during mid to late adult life. AD is characterized by a wide range of 

10 neuropathologic features including amyloid dqposits and intra-neuronal neurofibrillary tangles. 

Although the pathogenic pathway leading to neurodeg^ieration and AD is not well understood, at least 
three genetic lod that confer genetic susceptibility to the disease have been identified. (Schdlemberg, 
G.D. (1995) Proc. Natl. Acad. Sci. 92:8552-8559; Sherrington, R. et aL (1995) Nature 375:754-760.) 
Neuronal Thread Proteins (NTP) are a group of immunologically related molecules found in 

15 the brain and neuroectodmnal tumor cell lines. NTP expression is increased in neuronal cells during 
prolifCTation, differentiation, brain development, in Alzheimer's disease (AD) brains, and in 
pathological states associated with regenerative neare sprouting (dela Monte, S.M. et al. (1996) J. 
Neuropalhol. Exp. Neurol. 55:1038-1050). Monoclonal antibodies generated to a lecombhiant NTP, 
AD7C-NTP, isolated from an end-stage AD brahi library, showed high levels of NTP 

20 unmunoreactivity in poikarya, neuropil fibers, and white matter fibers of AD brain tissue, bi vitro 
studies also demonstrated NTP upregulation, phosphorylation, and translocation from the p^ikarya to 
cell processes and growth cones dming growth factor-induced neuitic sprouting and neuronal 
differentiation. Additionally, increased NTP immunoreactivity was found in Down syndrome brains 
beginning in the second decade, prior to establishment of widespread AD neurodegeneration, and at 

25 an age when a low-level or an absence of NTP expression was observed in control brains. These 
findings indicated that abnormal e^qpression and accumulation of NTP in brain may be an early 
marker of AD neurodegeneration in Dovm syndrome (de la Monte, S.M. et al. (1996) J, Neurol. Sd. 
135:118-125). Furthermore, the increased expression and accumulation of NTP in AD brain tissue 
was paralleled by corresponding elevations of NTP in cerebrospinal fluid (CSF). and elevated levels 

30 of NTP were detectable in the CSF early in the course of the disease. 

Astrocytomas, and the more malignant glioblastomas » are the most common primary tumors 
of tbe brain, accounting for over 65 % of primary brain tumors. These tumors arise in glial cells of the 
astrocyte lineage. Following infection by pathogens, astrocytes lunction as antigen-presenting cells and 
modulate the activity of lymphocytes and maaqphages. Astrocytomas constitutively express many 
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cytokuies and intaleukiiis that are nonnany prcxluced only after infection by a pathogen (de Micco, C. 
(1989) J. NeuroinmuuiDl. 25:93-108). ta the course of identifying genes related to astrocyte 
differentiation, one cDNA was isolated from an astrocytoma cDNA library that encodes a protein 
structurally related to the plant pathogenesis-related (PR) proteins (Muiphy, E. V. et al. (1995) Gene 

5 159:131-135). The glioma pathogenesis-related protein (GliPR) is highly eqiressedin glioblastoma, 
but not in fetal or adult brain, or in other nervous system tumors. PR protems are a family of small 
(10-20 kDa), protease resistant proteins induced la plants by viral infections, such as tobacco mosaic 
virus. The synthesis of PR proteins is believed to be part of a primitive immunological response in 
plants (van Loon, L.C. (1985) Plant MoL Biol. 4: 1 1 1-1 16). GliPR shares up to 50% homology with 

10 the PR-1 protein family over a region that comprises almost two thirds of the protein, including a 
conserved triad of amino acids, His-GIu-His, qyprqpriatdy spaced to form a metal-binding domain 
(Murphy et al., supra) . 

Fe65-like protein (Fe65L2), a new member of the Fe65 protein femily, is one of the ligands that 
interacts with the cytoplasmic domain of Alzheimer beta-amyloid precursor protein (APP). Transg^c 

15 mice expressing APP are known to simulate some of the promin^ behavioral and pathological features 
of Alzheimer's disease, including age-reLated impairment in learning and memory, neuronal loss, gliosis, 
nanitic changes, amyloid deposition, and abnormal tau phosphorylation. Proteins that interact with the 
cytqplasmic domain of APP provide new insights into the physiological function of APP fltyi^ in turn, 
into the pathogenesis of Alzhdmer's disease. (Duilio, A ^ al. (1998) Biochem. J. 330:513-519.) 

20 Contact from one neuron to auotha occurs at a specialized site called the syn2^ Atthissite, 

the axon terminal from one neuron (the presynaptic ceil) soids a signal to another neuron (the 
postsynaptic cell). Synapses niay be connected either electricafly or chemically. An dectrical synapse 
consists of gap junctions connecting the two neurons, allowing electrical impulses to pass directly from 
the presynaptic to the postsynaptic cell. In a chemical synapse^ the axon terminal of the presyn^c cell 

25 contains membrane vesicles contaming a particular neurotransmitter molecule. A change in dectrical 
potential at the nerve terminal results in the influx of calcium ions through voltage-gated rh^nnftig 
which triggers the release of the neurotransmitter from the synaptic vesicle by exocytosis. The 
neurotransmitter rapidly diffuses across the synaptic cleft separating the presynaptic nerve cell from 
the postsynaptic ceH. The neurotransmitter then binds receptors and opens transmitter-gated ion 

30 channels located in the plasma membrane of the postsynaptic cell, provoking a change in the cell' s 
electrical potential. This change in membrane potential of the postsyn^tic cdl may serve dtiier to 
excite or inhibit fiuther transmission of the n^e impulse. Presynaptic calcium channd activity is 
modulated by cystdne^string protdns (CSPs). CSPs are secretory vesicle proteins that function in 
neurotransmission as well as in exocytosis in other ceiQ-types. The effect of CSPs on calcium levds is 
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likely to be downstream of calcium rdease and is likely to im^olve exocytosis, possibly in connection 
with G-protdns (Magga, J.M. et al. (2000) Neuron 28:195-204; and Dawson-Scully, K. et al. (2000) J. 
Neurosd. 20:6039-6047; and Chamberlain, L.H. dt al. (2001) J. Ceil Sd. 114:445-455). 

Neurotransmitters comprise a diverse group of some 30 small molecules whidi include 
5 acetylcholine^ monoamines sudi as serotonin, dopamine, and histamine, and amino adds such as 
gamma-aminobutyric add (GABA), ghitamate, and aspartate, and neuropq)tides such as endoqphins 
and eikq)halms. (McCance, KL. and Huether, S.E. (1994) PATHOPHYSIOLOGY, Hie Biologic 
Basis for Disease in Adults and Children, 2nd edition, Mosby, St Louis, MO, pp 403-404.) Many of 
these molecules have more than one function and the eEBects may be excitatory, e. g. to ^polarize the 

10 postsym^tic cell plasma membrane and stimulate nerve iii5)ulse transmission, or inhibitory, e.g. to 
hyperpolarize the plasma mmbrane and inhibit nerve impulse transmissioa 

Neurotransmitters and their receptors are targets (tf pharmacological agents aimed at 
controlling neurological function. Fot example GABA is the major inhibitory neurotransnoitter in the 
CNS, and GABA receptors are the prindpal target of sedatives sudi as b6nzodiazq)mes and 

15 barbiturates whidi act by ehhanchig GABA-mediated effects (Katzung, B.G. (1995) Basic and Clinical 
Pharmacology . 6th edition, i^pleton & Lange, Norwalk, CT, pp. 338-339). 

Two major classes of neurotransmitta* transport's are essential to the function of the nervous 
syst^. The first class is uptake carriers in llie plasma membrane of neurons and glial ceils, which 
pvmp neurotransnoitters from the extracdlnlar space hito the cell. Ibis process relies on the Na* 

20 gradient across the plasma membrane, particularly the co-transport of Na*. Two families of proteins 
have been identified. One family indndes the transport's for GABA, monoamines such as 
noradrenaline, dopamine, serotonm, and amino acids such as glycine, and proline. Common 
structural components indude twelve putative transmeanbrane a-hdical domains, cytoplasmic N- and 
C- termini, and a large glycosylated extracellular loop separating transmembrane domains three and 

25 four. This family of homologous protems derives their en&cgy from the co-transport of Na* and CI" 
ions with the nemrotransmitter into the cdl (NaVCl* neurohransmitter transporters). The second family 
includes transporters for exdtatory amino adds such as glutamate. Common structural components 
include putative 6-10 transmembrane domains, cytoplasmic N- and C- termini, and glycosylations in 
the extracellular loops. The exdtatory amino add transporters are not dq)endent on Cl\ and may 

30 require intracellular ions (Na*/K*- neurotransmitt^ transporters) (Liu, Y. et al. (1999) Trends Cell 
BioL 9:356-363). 

The second class of neurotransmitter transport's is present in the vesicle membrane, and 
concentrates neurotransmittes from the cytoplasm into the vesicle, before exocytosis of the vesicular 
contents during synaptic transmission. Vesicular transport uses the electrochemical gradient across 
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the vesicular membrane generated by a iT-ATPase. Two families of proteins are involved in tbe 
transport of neurotransmitters into vesicles. One family uses primarily proton exdiange to drive 
transport into secretory vesicles and includes tbe transporters for monoamines and acetylcboline. For 
example, the monoamine transporters ^change two Ixuninal protons for each molecule of cytoplasmic 
5 transmito. Hie second family includes the GABA transporters, whidi relies on the positive charge 
inside synq)tic vesicles. The two classes of vesicular transporters show no sequence sunilarity to 
each other and have structures distmct from those of the plasma membrane carriers (ScMoss, P. et al. 
(1994) Cuir. Opin. Cell Biol. 6:595-599; Uu, Y. et al. (1999) Trends Cell Biol. 9:356-363). 
GABA is the predominant inhibitory neurotransmitter and is widely distributed in the 

10 mammalian nervous system GABA is cleared from the synq)tic cleft by q>ecific, high-affinity, Na*- 
and a*- dependent transporters, which are thought to be localized to both pre- and postsyn^c 
neurons, as well as to surroundhig glial cells. At least four GABA transporters (GATl -GAT4) have 
been cloned (Uu, Q.-R. et al. (1993) J. BioL Chem. 268:2106-2112). Studies of [^H]-GABAuptake 
into cultured cells and plasma-membrane vesicles isolated from various tissues revealed considerable 

15 differences in GABA transports hetoog^ity. GABA transporters exhibit differences in substrate 
affinity and sfpedficity, distinct blocker pharmacologies, and different tissue localization. For 
example, the K„ values of GABA uptake of the expressed GATl to GAT4 are 6, 79, 18, and 0.8 nM, 
req)ectively. In addition to transporting GABA, GAT2 also transports betaine; GAT3 and GAT4 also 
transport P-alanine and taurine. Pharmacological studies revealed that GABA transport by GATl and 

20 GAT4 is more sensitive to 2,4-diamm6butyric add and guavidne than that by GAT2 and GAT3. 
Situ hybridization showed that GATl and GAT4 e^qjression is brain specific. GAT2 and GAT3 
mRNAs were detected in tissues such as liver and kidney (Sdiloss, P. et al. (1994) Curr. Opin. Cell 
Biol. 6:595-599; Borden. L.A. (1996) Neurochem InL 29:335-356; Nelson, N. (1998) J. Neurochem. 
71:1785-1803). 

25 Human studies indicated that GABA transporter function is reduced in q)ilq)tic hq)pocampi. 

Decreased GABAergic neurotransmission has also been implicated in the pathophysiology of 

schizophrenia (Simpson, M.D, et al. (1992) Psychiatry Res. 42:273-282). 

Diazq)ain binding inhibitor (DBI), also known as endozepine and acyl-Coenzyme (CoA)- 

binding protdn, is an endogenous GABA receptor ligand which is thought to down-r^ate the effects 
30 of GABA. DBI binds medium- and long-chain acyl-CoA esters with very high affinity and may 

function as an intracellular carri^ of acyl-CoA esters (*125950 Diazepam Binding Inhibitor; DBI, 

Online Menddian Inheritance m Man (OMIM); PROSITE PDOC00686 Acyl-CoA-bmding protein 

signature). 

Glycine serves as one of the major inhibitory neurotransmitters in the mammalian nervous 
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system by activatiiig dHoride-channel receptors, which are members of a ligand-gated ion-channd 
supecfamily (Betz, H. (1990) Neuron 5:383-392). Glycine also facilitates excitatory transmission 
through an allosttt:ic activation of the N-methyl- D- aspartate (NMDA) receptor (Johnson, J. W. and P. 
Ascher (1987) Nature 325:529-531). Two fonns of glycine transporters have been described to date, 
5 GLYT 1 and GLYT 2. The two variants of GLYTl (GLYTl a/b) are generated by alternative 

spUcing (Lin, Q.-R. et al. (1993) J, Biol. Chem. 268:22802-22808) and differ in thek ammo-terminal 
sequences. The proteins have similar affinity for gilydne and both are sensitive to 
N-mettiyl-aminoacetic acid. In situ hybridization revealed that GLYTl a transcripts ware detected in a 
both neural and non-neural tissues, whereas GLYTlb was detected only in neural tissues (Borowslcy, 

10 B. et al. (1993) Neuron 10:851-863), High levels of GLYTla/b mRNA were found m hippocampus 
and cortex, suggesting a co-localization of this transporter with NMDA receptors and implying its 
involvement in the regulation of excitatory syn^c transmission. It is not clear whettio* GLYTla is 
ejqMressed in neurons, in gjia or in bottL In contrast, GLYTlb is found almost exclusively in fiber 
tracts, suggesting its locaUzation m gUal ceUs (Schloss, P. et al. (1994) Curr. Opin. Cell Biol. 

15 6:595-599). 

The second identified glycine transporter (GLYT2) diffCTS fi-om GLYTla/b by its extended 
intracellular amino tenminus. The predominant localization of its mRNA in brainstem and spinal cord 
and its insensitivity to N-mettiyl-aminoacetic acid suggests that GLYT2 terminates signal 
transduction at the strychnine-sensitive inhibitory glycine recq)tor. It has been proposed th^t, upon 

20 depolarization of cells harboring GLYTlb, tiie transporter runs backwards and releases glycine to act 
as a neuromodulatory amino add at the NMDA receptor (Attwell, D. and M. Bouvier (1992) Curr. 
Biol. 2:541-543). Such a Ca ^"^-independent, non- vesicular release of neurotransmitters by reverse 
transport was d^onstrated for glutamate and serotonin. This evidence suggests that the transmitter 
transporters may be important for both the initiation and tmnination of neurotransmitter action 

25 (Schloss, P. et al. (1994) Curr. Opin. CeU Biol. 6:595-599). 

The plasma membrane dopamine transporter (DAT) is essential for the reuptake of released 
dopamine from the synapse. Uptake of dopamine is temperature- and time-dependent, and is 
inhibited by a variety of compounds, such as cocaine. DAT- knockout mice have been shown to 
exhibit e3Ctreme hyperactivity and resistance to both cocaine and amphetamine, consistent with tiie 

30 primary action of cocaine on DAT (Giros, B. et al. (1996) Nature 379:606-612). The perturbation of 
the tightiy regulated DAT also predisposes neurons to damage by a variety of insults. Most notable is 
the selective degeneration of DAT-expressing dopamine nerve terminals in ttie striatum thought to 
underlie Parkinson's disease. DAT ejcpression can predict tiie selective vulnerability of neuronal 
populations, which suggests that tiierapeutic strategies aimed at altering DAT function could have 

35 significant benefits in a variety of disorders (Gary, W.M. et al. (1999) Trends Pharmacol. Sd. 
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20:424-429), 

Creatiiie transports are strongly related to transporters for GAB A. The primary sequence 
identity betwe^ creatine transporter species homologs is very high (98-99%). Pharmacological 
characterization demonstrated high affinity creatine uptake (27-43 jiM), which was blocked by 
5 creatine analogs with high affinity. Qreatine transporters are widely ejqjressed in a variety of 
mammalian tissues, including brain, adrenal gland, intestine, colon, prostate, thymus, ovary, spleen, 
pancreas, placenta, umbilical cord, thyroid, tongue, phamyx. vertebral discs, jaw, and nasal 
qpithdium. Genetic napping in the mouse localizes the creatme transporter to a re^^ 
chromosome in linkage conservation with the human region Xq28 , the location of the genes for 
10 several neuromuscular diseases (Nash, S.R. et al. (1994) Recq)tors Channels 2:165-174). 

Tbe substrates of a numba: of cDNA clones encoding proteins of the Na**" /Cl'-dependent 
transporter families are still not identified. These are orphan transporters. Identification of the 
substrates for orphan transporters has been difficult because in situ hybridization and 
immunohistochemistry indicate that the transporters are synthesized by phenotypically different 

15 neuronal populations, for example glutaminergic, GAB Aergic, histaminergic, or serotoninergic 
neurons. One of the transporters, NTT4 exhibits the Mghest homology to the creatine transport^^ It 
differs structurally from other members of this family in having an unusually long loop between 
transmembranes seven and eight (Liu. Q.-R. et al. (1993) FEBS Lett 315:114-118; Schloss, P. et al. 
(1994) Cuir. Opin. CeU Biol 6:595-599). 

20 Glutamate is a major excitatory neurotransmitter in the mammalian central nervous system. 

Electrogenic (Na* / K^-coupled glutamate transporters, located in the plasma membranes of nerve 
tOTninals and glial cells, mediate removal of glutamate released at excitatory synapses and maintain 
extracellular concentrations bdow neurototoxic levels. Glutamate transporters achieve this process 
by co-transport with three sodium ions and one proton, followed by translocation of a potassium ion 

25 in the opposite direction (Zerangue, N. and M.P. Kavanaugh (1996) Nature 383:634-637). 

Glutamate transporters belong to a large family of transport proteins. The membrane 
topology of the glutamate transporter reveals six membrane-spanning helices in the N-terminal part of 
the proteins (Slotboom, DJ. et al. (1999) Microbiol. Mol. Biol. Rev. 63:293-307). Hie C-tenninal 
half of the glutamate transporter is well conserved and constitutes a major part of the translocation 

30 pathway and contains the binding sites for the substrate and co-transported ions (Zhang. Y. and B.I. 
Kanner (1999) Proc. NaQ. Acad. Sci. USA 96:1710-1715). 

Impaired re-uptake of synaptic glutamate, and a reduced expression of glutamate transporters 
have been found in the motor cortex of patients with amyotrophic lat^al scl^osis (ALS). Inhibition 
of the synthesis of each glutamate transporter subtype using chronic antisense oligonucleotide 

35 administration, in vitro and in vivo, selectively and specifically reduced the protein expression and 

9 



wo 01/90148 



PCTAJSOl/16283 



function of glutamate transporters. The loss of glial glutamate transporters produced devated 
extracellnlar glutamate levels, neurodegeneration characteristic of excitotoxicity , and a progressive 
paralysis. The loss of the neuronal gilutamate transporter did not elevate extracellular glutamate in the 
striatum but produced mild neurotoxicity and resulted in epilepsy (Rothstein, J.D. et al. (1996) 
5 Neuron 16:675-686). 

The vesicular monoamine transporta:s (VMAT) package cytoplasmic monoamine 
neurotransmitters into secretory vesicles for regulated exocytotic rdease. VMAT acts as an 
dectrogenic exchange of protons and monoamines, using a proton dectrochemical gradient Two 
VMAT transportMS, VMATl and VMAT2 have been cloned from rat, bovine and human. The 

10 VMAT protdns possess twdve transmembrane segments, with both extremities lying on the 

cytoplasmic side. Examination of the subcellular localization of VMAT proteins indicates that they 
are associated with distinct vesicle populations in neurons and neuroendocrine cells. In rat, VMATl 
is ej^ressed in the adr^al gland whereas VMAT2 is expressed in the brain. In conhrast, the bovine 
adrenal gland expresses both VMATl and VMAT2 (Henry, J.-P. et al. (1994) J. Exp. Biol. 

15 196:251-262.). 

Vesicular transport is inhibited by the antihypertensive drug reseipine and the related but 
more centrally actmg drug tetrabenazine. The mechanism of transport and the biochemistry of 
VMAT have been analyzed with these drugs, using mainly the chromaffin granules from bovine 
adrenal glands as a source of transporters (Peter, D. et al. (1994) J. BioL Chem. 269:7231-7237). 

20 Human studies indicated that reseipine can cause a syndrome resembling depression, 

indicating the importance of vesicular transport activity for the control of mood and behavior. The 
psychostimulant amphetamine also disrupts the storage of amines in secretory vesicles^ further 
indicating that alt^adons in vesicular monoamine transport can affect behavior (Sulzer, D. and S. 
Rayport (1990) Neuron 5:797-808). 

25 Human diseases caused by defects in neurotransmitter transporters include schizophrenia, 

Tourette's syndrome, Parkinson's disease, brain ischemia, amyotrophic lateral scerlosis, depression, 
and epilq)sy. For example, decreased GAB Aergic neurotransmission has been implicated in the 
pathophysiology of CNS disorders such as epilq}sy and schizophrenia. Impaired re-uptake of 
synaptic glutamate, and a reduced expression of the glutamate transporter have been found in the 

30 motor cortex of patients with amyotrophic lateral sclerosis (ALS). The loss of glial glutamate 
transporters produces elevated extracellular glutamate levels, neurodegeneration characteristic of 
excitotoxicity, and a progressive paralysis. The loss of neuronal glutamate transporters produces mild 
neurotoxicity and result in epilepsy (Rothstem, J.D. et al. (1996) Neuron 16:675-686). 

Transporters for dopamine, norepinephrine, and serotonin have particular significance as 

35 targets for clinically relevant psychoactive agents including cocaine, antidepressants, and 
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amphetamines. Cocaine and antidepressants are transporter antagonists that act with varying degrees 
of specificity to enhance sym^c concentrations of amines by limiting clearance. Amphetamines 
enhance transporter mediated efflux in concert with a dq)letion of vesicular amine stores (Bark^, 
E.L. and R.D. Blakely (1995) Psychopharmacology 28:321-333; Sulzer, D. andS. Rayport (1990) 
5 Neuron 5:797-808; WaU, S.C. et al. (1995) Mol. Pharmacol. 47:544-550). 

Each of over a trillion neurons in adult humans connects with over a thousand target c^ 
(Tessier-Lavigne. M. et al. (1996) Science 274:1 123-1133). These neuronal connections form during 
embryonic development. Each differ^tiating neuron sends out an axon tq)ped at the leading edge by a 
growth cone. Aided by molecular guidance cues, the growth cone migrates thorough the anbryonic 

10 environment to its synaptic target Semaphorins are growth cone guidance signals that may ftmction 
during embryogoiesis by providing local signals to specify territories inaccessible to growing axons 
(Puschd, AW. et al. (1995) Nraron 14:941-948). 

The guidance of axons during devdqpmeot involves both positive and negative effects (Le. , 
cfacmoattraction and chemorepulsion). The Slit family of proteins have been in^plicated m promoting 

15 axon brandling, elongation* and rq)ulsioa Members of the Slit family have been id^itified in a variety 
of organisms, including insects, amphibians, birds, rodents and humans (Guthrie, S. (1999) Currait 
Biology 9:R432-R435). Slit protdns ^pear to be ligands for the repulsive guidance recqrtor, 
Roundabout (Robo); however. Slit also cause elongation in some assays. A post-translationally 
processed form of Slit appears to ttie active form of the protein (Guthrie, S . supra and Brose, K. etaL 

20 (1999) Cell 96:795-806). 

Axon growth is also guided in part by contact-mediated medianisms involving cell surface and 
extraceJlular matrix (ECM) molecules. Many ECM molecules, including fibronectin, vitronectin, 
meihbers of the laminin, tenascin, collageo, and thrombospondin families, and a variety of 
proteoglycans, can act dther as promota-s or inhibitors of neurite outgrowth and extension (Tessier- 

25 Lavigne et al.> supra) . Recqptors for ECM mciecules include integrins,inmiunoglobulinsuperfain3^ 
monbers, and proteoglycans. ECM molecules and their recqptors have also been inq)licated in the 
adhesion, maintoiance, and differ^tiatton of neurons (Rdchardt, L.F. et al, (1991) Ann. Rev. 
Nairosd. 14:531-571). The proteoglycan testican is localized to the post-synaptic area of pyramidal 
ceDs of the hippocampus and may play roles in recq)tor activity, neuromodulation, synaptic plasticity, 

30 and neurotransmission (Bonnet, F. et al. (1996) J. BioL Chsm. 271 :4373-4380). 

Other nervous system-associated protdns have roles in neuron signaling, ceJl adhesion, nerve 
regeneration, axon guidance, and neurogenesis. The neurexpphilins are neuropeptide-like protdns 
which are proteolytically processed after synthesis. They are ligands for the neuron-spedfic cell 
surface proteins, the a-neur&dns. Neurexpphilins and neurexins may partidpate in a neuron signaling 
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pathway (Misslo:, M. andT.C. Sudhof (1998) J. Nairosci. 18:3630-3638; Missler, M. et al. (1998) J. 
BioL Chem. 273:34716-34723). Ninjurin is a neuron cdl surface protdn whidi plays a role in ceill 
adhesion and in nerve regsierationfonowing injury. Ninjurin is up-r^gulatedaftea* nerve injury in 
dorsal root ganglion neurons and in Schwann cdls (*602062 Ninjurin; NINJl OMIM; Araki, T. and 
5 WBlhrandt, J. (1996) Neuron 17:353-361). Mammalian Numb is a phosphotyroshie^blnding (PTB) 
domain-containing protein wtiich may be involved in cortical neurogenesis and cdl fate decisions in the 
m a m m a lian nervous system. Numb's binding partner, the LNX protein, contains four PDZ domains 
and a ring finger domain and may participate in a signaling pathway involving Numb. PDZ domains 
have been found in proteins which act as ad^tors in the assembly of multifunctional protein 

10 complexes involved in signaling events at surfaces of cell membranes (Ponting, CP. (1997) 

Bioessays 19:469-479). LNX contains a tyrosine phosphorylation site which may be important fcr the 
binding of otha: PTB-containing proteins such as SHC, an adaptor protm which associates with 
tyrosine-phosphorylated growth factor recq)tars and downstream effectors (Dho, S.E. al. (1998) J. 
BioL ChmL 273:9179-9187). 

15 Another family of molecules thai appear to be hnportant for noirotransmission is the choline- 

transportor-like CTLl proteins. The prototypic CTLl was identified in yeast as a suppressor of a 
choline transport mutation; however, mammalian homdogues have been identified. The proteins 
conprise approximately ten putative transmembrane domains in addition to transporter-like motife but 
do not appear to be canonical choline tranq)orters. Choline transport is important to neurotransmission 

20 because choline is a precursor of ac^yldaoline* required in abundance by cholinergic neurons (O'R^an, 
S. et aL (2000) Proc. Nail. Acad Sci. U.S.A, 97:1835-40). 

The discovery of new neurotransmitter transporters and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevration, 
and treatment of transport, neurological and psychiatric disorders, and in the assessment of the effects 

25 of exogenous compounds on ttie expression of nucleic add and amino acid sequences of 
neurotransmitt^ transporters. 

SUMMARY OF THE INVENTION 
The invoition features purified polypeptides, neurotransmitter transporters, referred to 
30 collectivdy as "NTT' and individually as **NTT-1." "NTT-2," *'NTT-3," **NTT-4," "TNTT-S," and 
**NTT-6.". In one aspect, the hiv^on provides an isolated polypeptide selected firom the group 
consisting of a) a polypeptide comprising an amino acid sequence selected firom the group consisting of 
SEQ ID NO: 1-6, b) a naturally occurring polypeptide comprising an amino add sequence at least 90% 
identical to an amhio acid sequence sdected from the group consisting of SEQ ED NO:l-6, c) a 
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biologically active fragment of a polypeptide having an amino acid sequence sdected from the group 
consisting of S£Q ID NO: 1-6, and d) an immunogenic fragment of a polypqjtide having an amino add 
sequence sdected from the group consisting of SEQ ID NO: 1-6. In one alternative* the invention 
provides an isolated polypq)tide canqxrising the amino add sequence of SEQ ID N0:l-6. 

Hie inv^on further provides an isdated polynucleotide encoding a polypq)tide selected from 
the group ccmsistmg of a) a polypeptide oon;)rising an amhio add sequence sdected from the group 
consisting of SEQ ID NO: 1-6, b) a naturally occurring pQlypq)tide comprising an amino add sequence 
at least 90% identical to an amino add sequoice selected from the group consisting of SEQ ID N0:l-6, 
c) a biologically active fragment of a polypeptide having an amino add sequence sdected from Hie 
group consisting of SEQ ID NO: 1-6, and d) an immunogenic fragment of a polypeptide having an 
amino add sequence sdected from the group consisting of SEQ ID NO:l-6. la one alternative, the 
polynudeotide mxxies a polypq)tide sdected from the group consisting of SEQ ID N0:l-6. In another 
alternative, the polynucleotide is sdected from the ffoap consisttag of SEQ ID NO:7-12. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence qperably linked to a polynucleotide encoding a polypeptide sdected from the group consisting 
of a) a polypeptide comprising an ammo add sequaice sdected from the group consisting of SEQ ID 
N0:l-6, b) a naturally occurring polypeptide comprismg an amhio add sequoice at least 90% identical 
to an amino add sequMice sdected from the group consistuig of SEQ ID NO: 1-6, c) a biologically 
active fragment of a polypq)tide having an amhio add sequence selected from the group consisting of 
SEQ ID N0:l-6, and d) an immunogenic fragment of a pQlypq)tide havmg an amino acid sequence 
sdected from the group consisting of SEQ ID N0:l-6. In one altmiative, the invention provides a cdl 
transformed with the reconabinant polynucleotida In another alternative, the invention provides a 
transgenic c^ganism comprising the recombinant pdynucleotida 

The inv^tion also provides a mettKxl for producing a polypq)tide sdected from the group 
consisting of a) a polypq)tLde con^rising an amino add sequence sdected from the group consisting of 
SEQ ID NO: 1-6, b) a naturally occurring polypeptide con5)rising an amino add sequence at least 90% 
identical to an amino add sequ^ice sdected from tlie group consisting of SEQ ID N0:l-6, c) a 
biologically active fragment of a polypeptide having an amino add sequence sdected from the ffoap 
consisting of SEQ ID N0:l-6, and d) an immunogaiic fragment of a polypeptide having an amino add 
sequaace sdected from the group consistmg of SEQ ID NO: 1-6. The method conoprises a) culturing a 
cdl under conditions suitable for expression of the pdlypq)tide> v^^herein said cdl is transformed witti a 
recombinant polynudeotide comprising a promoter sequence q)erably linked to a polynudeotide 
encoding flie polypeptide, and b) recovering flie polypeptide so ejqjr^sed. 

Additionally, the invention provides an isolated antibody which spedfically binds to a 



13 



wo 01/90148 



PCTAJSOl/16283 



pdypq)tide selected firom the group consisting of a) a polypeptide compiisiiig an amino add sequence 
sdected from the group consisting of SEQ ID N0:l-6. b) a naturally occurring polypq)tide con5)rising 
an amino acid sequence at least 90% identical to an amino acid sequence sdected firom the group 
consisting of SEQ ID NO:l-6, c) a biologically active fragm^ of a polypeptide having an amino acid 
5 sequence selected from the group consistuig of SEQ ID N0:l-6, and d) an immunogenic fragment of a 
polypq)tide having an amino acid sequence sdected from the group consisting of SEQ ID NO: 1-6. 

The invention further provides an isolated polynucleotide sdected from the group consisting of 
a) a polynucleotide comprising a polynucleotide sequ&Dce sdected from the group consisting of SEQ ID 
NO:7-12, b) a naturally occurring polynucleotide comprising a polynucleotide sequence at least 90% 

10 id^cal to a polynudeotide sequoice sdected from the groiip consisting of SEQ ID NO:7-12, c) a 
polynudeotide conq)lem£ntary to the polynucleotide of a), a polynudeotide complonentary to the 
pdynudeotide of b), and e) an RNA equivalent of a)-d). In one alternative^ the pdymicleotide 
comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a m^od for detecting a target polynucleotide in a sample^ 

15 said target polynucleotide having a sequence of a polynucleotide sdected from the group consisting of 
a) a polynucleotide conq}rising a polynucleotide sequaice sdected from the group consisting of SEQ ID 
N0:7-12, b) a naturally occurring polynucleotide con^jrising a polynudeotide sequence at least 90% 
id^itical to a polynucleotide sequ^ce sdected from the group consisting of SEQ ID NO:7-12, c) a 
pdynudeotide con5)lCTaaatary to the polynucleotide of a), d) a polynucleotide complOTeotary to the 

20 pdynudeotide of b), and e) an RNA equival^ of a)-d). The metiuxl comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides coDq)rising a sequaoce 
con5)lCTaentary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide^ under conditions whereby a hybridization complex is formed between said 
probe and said target polynudeotide or fragments thereof, and b) detecting the pres^ice or absence of 

25 said hybridization complex, and optionally, if present, the amount thereof. In one alternative, tbe probe 
comprises at least 60 contiguous nudeotides. 

Hie invention frirthar provides a metiiod for detecting a target polynucleotide in a sample, said 
target polynucleotide having a sequence of a polynucleotide sdected from ttie group consisting of a) a 
polynucleotide comprising a polynudeotide sequence sdected from the group consisting of SEQ ID 

30 NO:7-12, b) a naturally occurring pdynudeotide can[^)rismg a polynucleotide sequm» at least 90% 
idsatical to a polynucleotide sequence sdected from the group consisting of SEQ ID NO:7-12, c) a 
polymicleoflde complanentary to the polynudeotide of a), d) a polynucleotide complementary to the 
pdymideotide of b), and e) an RNA equivalent of a)-<?). Hie metiiod conqprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
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detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 
optionally, if present, the amount thereof. 

Tbe invention faxtbec provides a conq)osition comprising an effective amomit of a polypq)tide 
selected from the group consisting of a) a polypeptide comprising an amino add sequence selected from 
5 the group consisting of SEQ ID N0:l-6, b) a naturally occurring pdypqptide comprising an amino add 
sequence at least 90% id^cal to an amino add sequence sdected from the group consisting of SEQ 
ID NO:l-(}, c) a biologically active fragment of a polypeptide having an amino add sequ^ce sdected 
from the group consisting of SEQ ID N0:l-6, and d) an immunogenic fragment of a polypeptide having 
an amino add sequaice sdected from the group consisting of SEQ ID N0:l-6, and a pharmaceuticaUy 

10 acceptable exdpient In one embodiment, the composition con^rises an amhio add sequence sd 

from the groi^p consisting of SEQ ID N0:l-6. The invention additianaUy provides a m^od of treatmg 
a disease or condition assodated vwth decreased esqpression of functional NTT, conpisuig 
administering to a pati^ in need of such treatment the composition 

The invention also provides a method for screening a compound for effectiveness as an 

15 agonistof a polypq)tide sdected from the groiqp consisting of a) a polypeptide coiqjrismg an amino 
add sequence sdected from the group consisting of SEQ ID N0:l-6, b) a naturally occurring 
polypqrtide comprising an amino add sequence at least 90% identical to an amino add sequence 
sdected from the group consisting of SEQ ID N0:l-6, c) a bidogicaUy active fragment of a 
polypq>tide having an amino acid sequence sdected from tbe group consisting of SEQ ID N0:l-6, and 

20 an imnmoganc fragment of a polypeptiidehavhig an ammo acid sequence sdect^ 

consisting of SEQ ID NO:l-6. The method conq>rises a) ejqwsing a sample comprising the 
polypq)tide to a compoxmd, and b) detecting agonist activity in the sample. In one altanative, the 
invMition provides a composition conprising an agonist compound identified by the metliod and a 
pharmaceutically acceptable excipient In anotiia: alternative the invention provides a method of 

25 treating a disease or condition associated with decreased ejcpression of functional NTT, comprising 
administering to a pati^t in need of sudi treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide sdected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consistmg of SEQ ID NO: 1-6, b) a naturally occurring 

30 polypeptide con5)risuig an ammo acid sequence at least 90% identical to an amhio acid sequence 
selected from the group consisting of SEQ ID N0:l-6, c) a biologicaQy active fragment of a 
polypq)tide having an ammo acid sequence sdected from the group consisting of SEQ ID NO: 1-6, and 
d) an immunogenic fragment of a polypqrtide having an ammo add sequojce sdected from the group 
consisting of SEQ ID NO:l-6. The method comprises a) exposmg a sample comprising the 
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polypq>tide to a compound, and b) detecting antagonist activity in the sample. In one alternative^ tbe 
invention provides a conq)osition comprising an antagonist compomid id^tified by the method and a 
pharmaceutically acc^table excipient In another alternative, the invention provides a method of 
treating a disease or condition associated with overexpression of functional NTT, comprising 
5 administ^g to a patient in need of sudi treatment the composition. 

The invention further provides a method of scre^oing for a confound that specifically binds 
to a polypeptide selected from the group consisting of a) a polypqitide conq}rising an amino add 
sequence sdected from the groiq) consisting of SEQ ID N0:l-6, b) a naturally occurrujg polypeptide 
cmoprising an amino acid sequence at least 90% idaitical to an amino add sequence selected from the 

10 group consistmg of SEQ ID N0:l-6. c) a biologically active fragm^ of a polypeptide having an amino 
add sequence sdected from the group consisting of SEQ ID N0:l-6, and cD an immunogenic fragment 
of a polypeptide having an amino add sequoice selected from the group consisting of SEQ ID NO;i -6. 
The method comprises a) combining the polypeptide with at least one test compound under suitable 
conditions, and b) detecting binding of the polypeptide to the test compound, thereby identifying a 

15 compound that specifically bmds to the polypq>tide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypq)tide sdected from the group consisting of a) a polypeptide comprising an amino 
add sequence sdected from the group consistmg of SEQ ID NO: 1-6, b) a naturally occurring 
polypq)tide con5)rising an ammo add sequaice at least 90% ideatical to an amino add sequence 

20 sdected from the group consisting of SEQ ID NO:l-6, c) a biologically active fragmait of a 

polypq)tide having an aidno add sequence sdected from the group consisting of SEQ ID NO: 1 -6, and 
(S) an immunogenic fragment of a polypq)tide having an amino add sequence sdected from the group 
consisting of SEQ ID NO:l-6. The method comprises a) combining the polypeptide with at least one 
test compound und^ conditions permissive for tiie activity of the polypq)tide, b) assessing the 

25 activity of the polypq)tide in the presence of the test conqwund, and c) comparing tiie activity of tiie 
polypeptide in the presence of the test compoxmd with tiie activity of tiie polypq)tide in the absence of 
the test compound, wherein a change in the activity of the polypeptide in the presence of the test 
compound is indicative of a compound that modulates the activity of the polypeptide. 

The invoition further provides a method for screening a compound for effectiveness in 

30 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 

sequence selected from tiie group consisting of SEQ ID NO:7-12, tiie method comprising a) exposing 
a sample comprising the target polynucleotide to a conq)ound, and b) detecting altered e3q)ression of 
the target polynudeotide. 

The invention furih^ provides a metiiod for assessing toxicity of a test compound, said 
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method comprising a) treating a biological sample containing nucleic adds with the test compound; 
b) hybridizing the nudeic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynudeotide sdected fiom the groiq) consisting of i) a polynucleotide 
comprising a polynudeotide sequence selected from the group consisting of SEQ ID NO:7-12, ii) a 
5 naturally occurring polynucleotide comprising a polynudeotide sequence at least 90% identicjQ to a 
polynudeotide sequence sdected from the ffoap consisting of SEQ ID NO:7-12, iii) a polynucleotide 
having a sequeaoce complementary to i), iv) a polynudeotide con5)lementary to the polynucleotide of 
ii), and v) an RNA equivalent of i)-iv). Hybridization occurs undo: conditions wherd>y a specific 
hybridization complex is fanned between said probe and a target polynucleotide in the biologicsa 

10 sample, said target polynucleotide sdected from the group consisting of i) a polynucleotide 

comprising a polynudeotide sequence selected from the group consisting of SEQ ID NO:7- 12, ii) a 
naturally occurring polynucleotide comprising a polynudeotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID N0:7-l 2, iii) a polynucleotide 
conq)lementary to the polynucleotide of i), iv) a polynucleotide complementary to tbe polynucleotide 

15 of ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a 
fragment of a polynudeotide sequence selected from flie group consisting of i)- v) above; c) 
quantifying the amount of hybridization conq)lex; and d) comparing the amount of hybridization 
complex in the treated biological sample with the amount of hybridization complex in an untreated 
biological sample, wh^dn a difference in the amount of hybridization complex in tiie treated 

20 biological sample is indicative of toxicity of the test conqxjund. 

BRIEF DESCRIPTION OF THE TABLES 
Table 1 summarizes tiie nomenclature for tiie full lengOi polynudeotide and polypeptide 
sequ^ces of the present invaation. 
25 Table 2 shows the GenBank identification numba^ and annotation of the nearest GenBank 

homolog for polypeptides of the inventioa The probability score for the match between eadi 
polypq)tide and its GeaiBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including predicted 
motifs and domains, along witii the methods, algorithms, and searchable databases used for analysis of 
30 the polypeptides. 

Table 4 lists the cDNA and genomic DNA fragments whidi were used to assemble 
polynucleotide sequences of tiie invention, along witii sdected fragments of flie polynudeotide 
sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the inventioa 
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Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tocfls, progranos^ and algoriflmis used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

5 

DESCRIPTION OF THE INVENTION 

Before the present protdns, nucleotide sequ^ces, and mdhods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used hffein is for the purpose of d^ 

10 particular mbodim^ only, and is not intended to liniit the scope of the present hiventionwMch win 
be limited only by the app^ided claims. 

It must be noted that as used harein and m the iqjpeanyded claims, the singular forms "a," "an." 
and "the" uidude plural reference unless the context clearly dictates otherwise. Thus, for exan?)le, a 
reference to "a host cdl" includes a plurality of such host ceils, and a reference to "an antibody" is a 

15 reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

IMess defined otherwise, all technical and scientific terms used herein have the same mptflmfigQ 
as commonly understood by one of ordinary skQl in the art to whidi this invention belongs. Although 
any machhies, materials, and methods similar or equivalent to those described herein can be used to 

20 practice or test the present inventian, the preferred machines, materials and m^hods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosmg the cell lines, 
protocols, reagoits and vectors wMch are reported in the pubUcations and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not eotitied to antedate such disclosure by virtue of prior inveutioa 

25 DEFINITIONS 

"NTT" refers to the amino acid sequences of substantially purified NTT obtained fi-om any 
species, particularly a mammalian species, including bovine, ovine, pordne, murine, equine, and 
human, and from any source, wheQier natural, syntti^c, semi-synthetic, or recombinant. 

The term "agonisf * refers to a molecule which intensifies or mimics the biological activity of 
30 NTT. Agonists may include proteins, nudeic adds, carbohydrates, small molecules, or any other • 
compound or composition which modulates the activity of NTT either by dSrecfly hateracting with NTT 
or by acting on components of the biological pathway in which NTT partidpates. 

An "aflelic variant" is an altmiative form of the gene encoding NTT. Afldic variants may 
result from at least one mutation in the nucldc add sequence and may result in altered mRNAs or in 
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polypeptides whose structure (x function may or may not be altered. A geoe may have none» one» or 
many alldic variants of its naturally occorring fona Common mutational dianges wMdi give rise to 
alldic variants are geasxBRy ascribed to natural deletions, additions, or substitutions of nucLeotides. 
Eachofthesetypesof changes may occur alone, or in combination with tteoth^ one or more times in 
5 a given sequence. 

"Altered" nucleic acid sequooces encoding NTT include those sequences with deletions, 
ins^ons, or substitutions of differoit nucleotides, resulting in a polypeptide the same as NTT or a 
polypeptide with at least one functional characteristic of NTT. bicluded within this d^mition are 
polymorphisms which may or may not be readily detectable nsing a particular oligonucleotide probe of 

10 the polynudeotide encoding NTT, and improper or unexpected hybridization to alldic variants, with a 
locus oth^ than the nonnalcbromosonial locus for the pdynucleotidesequ^K^een^ The 
encoded protein may also be "altered," and may contain deletions, ins^ons, or substitutions of amino 
add residues wMch produce a silent diange and resuU hi a fvuK^tionaUyeq^ Ddiberate 
amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

15 hydrophobicity, hydrq}hilicity, and/or the amphipattiic nature of the residues, as long as the biological 
or immunological ac^vity of NTT is retained. For example, negatively charged amino acids may 
include aspartic add and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino adds with uncharged polar side chains having similar hydrophilicity values may 
indude: asparagine and glutamine; and s^ne and threonine. Amino acids with uncharged side diains 

20 having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino add" and "amino add sequeace" refer to an oligopeptide, peptide, 
polypqitide, or protdn sequence, or a fragmoit of any of these, and to naturally occurring or synthetic 
mcflecules. Where "amino add sequence" is redted to refer to a sequence of a naturally occurring 

25 protdn molecule, "amhu> add sequence" and like terms are not meant to limit the amino add sequence 
to the ounplete native amino add s equence associated with the redted protdn molecule. 

"Amplification" rdates to the production of additional ccpks of a nucldc add sequence. 
Amplification is geoorally carried out using polymerase diain reaction (PGR) technologies wdl known 
in the art. 

30 The term "antagonist" refers to a m(flecule whidi inhibits or attaniates the biological activity of 

NTT. Antagonists may indude protdns sudi as antibodies, nuddc adds, carbohydrates, small 
molecules, or any other conqxnmd or con^sition whidi modulates the activity of NTT dther by 
direcfly interacting with NTT or by acting on con^poneDts of the biological pathway in which NTT 
partidpates. 
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The tenn "antibody'' refers to Intact iinmunoglobulin molecules as wdl as to fragments thereof, 
sudbi as Fab, F(ab')2> and Fv fragments, wbich are enable of binding an q)itppic detenninant 
Antibodies that bind NTT polypq)tides can be prepared using intact polypqjtides or using fragments 
containing small pq)tides of interest as ttie immunizing antigen. The polypeptide or oligopeptide used 
5 to immunize an animal (e.g. , a mouse, a rat, or a rabbit) can be derived from the translation of RN A, or 
synthesized cbemically, and can be conjugated to a carrier protein if desired. Commonly used carrim 
that are chemically coiq>led to peptides include bovine serum albumin, thyroglobuUn, and keyhole 
limpet li»nocyanin(KLH). The coupled pq}tide is then used to immunize the animal 

The term "antigenic determinant" refers to that r^on of a molecule (Le., an q}itQpe) that 

10 makses contact with a particular antibody. Whenaprotdnor afragmentof aprotdnisusedto 

immunize a host animal, numerous regions of the protdn may induce ttie production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dunensional structures on the 
protdn). An antigenic deteoninant may oonq)ete with Ihe intact antigen (Le., the immunogen used to 
elidt the immune response) for binding to an antibody. 

15 The term "antisaise" refers to any conipositiQn capable of base-pairing with the "sense" 

(coding) strand of a specific micldc add sequence. Antis^e compositions may indude DNA; RNA; 
peptide nuddc acid (PNA); oligonucleotides haviDg modified badcbone linkages sudi as 
phosphorothioates, methylphosphonates, or b^izylphosphonates; oligonucleotides having modified 
sugar groups sudi as 2 -methoxydliyl sugars or 2 -metfaoxyethoxy sugars; or oligonudeotides having 

20 modified bases sudi as 5-methyl cytoshie, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosinew Antis^e 
molecules may be produced by any nietfaodinduding chemical synthesis^ Once 
introduced into a ceJl, the conq)lmentary antisense molecule base-pairs with a naturally occurring 
nucldc add sequaice produced by the ceil to form duplexes whidi block dther transaription or 
transMioa The designation "n^ative" or ''minus" can refer to the antisaise strand, and the 

25 designation '^positive" or '*plus" can refa: to the sense strand of a rrferoice DNA molecule. 

The term '^biologically active" refers to a protdn having structural, regulatory, ot biochemical 
functions of a naturally occurring molecule. likewise^ "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic NTT, or of any oligopeptide thereof, to 
induce a specific immune response in appropriate anhnals or cdls and to bind with specific antibodies. 

30 "CcHuplonentary" describes the rdationship between two singjfr-stranded nuddc add 

sequences that anneal by base-pairing. For ©cample, 5'-AGT-3* pairs with its coniplement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "con?)osition conaprising a 
giv^ amino add sequence" refer broadly to any composition containiQg the given polynudeotide or 
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amino add sequence. Thecan^wsMonmaycon^xrisea*^ 

Coii5)Ositions comprisiDg polynucleotide sequences encoding NTT car fragmaits of NTT may be 
employed as hybridization probes. The probes may be stored in freez^dried farm and may be 
associated with a stabilizing ag^ sudti as a carbohydrate. In hybridizations, the probe may be 
dq)loyed in an aqueous solution containing salts (e.g., NaQ), digests (&g., sodium dodecyl sulfate; 
SDS)» and other components (e.g.» Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"ConssQSus sequence" refers to a nuddc add sequ^ice whidi has been subjected to repeated 
DNA sequMice analysis to resolve uncalled bases» extaided ushig the XL-PCR kit (Applied Biosystems, 
Fostear City C A) in the 5' and/ca: the 3' direction, and resequenced, or which has been assrabled from 
one or more overlapping cDNA. EST, or genomic DNA fragmraits usmg a con^juter program for 
fragment assembly, such as the GELVEW fragment assembly systan (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have beoa both extended and assembled to 
produce tbe consensus sequence. 

''Conservative amino add substitutions" are those substitutions that are predicted to least 
interfere with the propoties of the OTiginal protein, i. e. , the structure and especially the functian of the 
protdn is conserved and not significantly dianged by such substitutions. The table bdow shows amino 
adds whidi may be substituted for an cuiginal amino add in a protdn and whidi are regarded as 
conservative amino add substitutions. 



Original Residue Conservative Substitution 



Ala 


Gly, S&c 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


ne 


Lai, Val 


Leu 


ne,Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


His, Met, Leu, Ttp, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Tip 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


De, Lai, Thr 



Consavative amino add substitutions gaierally maintain (a) the structure of the polypeptide 
backbone ip the area of the substitution, for example, as a b^ sheet or alpha hdical conformation. 
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(b) th& charge or hydrpphobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino add or nucleotide sequmce that results in the 
absoice of one or more amino acid residues or nucleotides. 
5 The tenn "derivative" refers to a chanically modified polynucl^ Chemical 

uKxlifications of a polynucleotide can include^ for eKample» replacemoot of hydrogen by an alkyl, acyl, 
hydroxyl, or amino groiq). A derivative polynucleotide encodes a polypeptide which retains at least one 
biological or immunological function of the natural molecule. A derivative polypeptide is one modified 
by jjycosylation, pegylation, or any similar process that retains at least one biological or immunological 
10 function of the polypq)tide firom which it was derived. 

A "detectable labd" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalentiy or noncovalentiy joined to a polynucleotide or pciypq>tide. 

"Dififearential expression" refers to increased or upregulated; or decreased, downregulated, or 
absent g^ or protdn repression, determined by conq)aring at least two different saii9)les. Such 
15 comparisons may be carried out between, for GKjmplGt a treated and an untreated saiq)Ie, or a diseased 
and a nomal sanq)le. 

A "fragment" is a unique portion of NTT or tiie polynucleotide encoding NTT which is 
idaitical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire lengtb of the defined sequence, minus one nucleotide/amino acid residue. For example, a 

20 fragment may comprise flrom 5 to 1000 contiguous nudeotides or amino add riesidues. A fragment 
used as a probe, primer, antigen, tha:^eutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20, 25. 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nudeotides or amino acid 
residues in length. Fragmentsmaybepreferentially selected firom certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino adds selected 

25 from the first 250 or 500 amino adds (or first 25 % or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is suj^orted by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID N0:7- 12 comprises a region of unique polynudeotide sequence that 
30 specifically identifies SEQ ID NO:7-12, for example, as distinct fi:om any otha: sequence in the 
g^me from which the fragment was obtained, A fragment of SEQ ID NO:7-12 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that di.<8tingni.<ih 
SEQ ID NO:7~12 firom related polynucleotide sequences. The predse length of a fragment of SEQ ID 
NO:7-12 and tiie region of SEQ ID NO:7-12 to which the fragment corresponds are routindy 
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determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID N0:l-6 is encoded by a fragment of SEQ ID NO:7-12. A fragment of 
SEQ ID N0:l-6 comprises a region of unique amino acid sequence lliat specifically identifies SEQ ID 
NO; 1-6. For exanq)le, a fragment of SEQ ID N0:l-6 is useful as an immunogenic peptide for the 
5 development of antibodies that specifically recognize SEQ ID N0:l-6. Ihe precise Icaigth of a 

fragmait of SEQ ID NO: 1-6 and the region of SEQ ID NO:l-6 to which the fragment corresponds are 
routinely detenninable by one of ordinary skill in the art based on the int^ded puipose for the 
fragment. 

A '"ftiU length*' polynucleotide sequ»ce is one containing at least a translation initiation codon 
10 (e.g., methionine) followed by an open reading frame and a translation termination codon. A "ftdl 
length" polynucleotide sequence encodes a ''ftill length" polypeptide sequence. 

"Homology" refers to sequeice similarity or, interdhangeably, sequwice idratity, between two 
(X more polynucleotide sequences or two or more pQLypq>tide sequences. 

The terms **percait ideaofity" and "% identity," as applied to polynucleotide sequences, refer to 
15 the percentage of residue matches between at least two polynucleotide sequences aligned using a 

standardized algorithm. Such an algorithm may msert, in a standardized and rq)rodudble way, gaps in 
the sequoQces being conq)ared in order to qptunize alignment between two sequences , and therefore 
achieve a more meaningful conq)arison of the two sequsoces. 

Perc^ idaatity between polynucleotide sequ^ices may be determined using the default 
20 paramos of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence 
jOignment program. This program is part of theLASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WT), CLUSTAL V is described in Higgins, D.G. 
andP.M. Sharp (1989) CABIOS 5:151-153 andhiHiggms, D.G. et aL (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
25 Ktople^2, gap p^altysS, windows=4. and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default Percentidoittty is rqxjrted by CLUSTAL V as the **pffl-caitsuM 
aligned polynucleotide sequaices. 

AltCTiatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Canter for Biotedmology Information (NCBI) Basic Local Alignment Search 
30 Tool (BLAST) (Altschul, S.R di aL (1990) J. MoL Biol 215:403-410), which is available from several 
sources, mdudlng the NCBI, Bethesda, MD, and on the Int^net at 

http://www.ncbihlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including 'Tjlastn," that is used to align a known pdynudeotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called **BLAST 2 



23 



wo 01/90148 



PCT/USO 1/16283 



Sequences" that is used for direct pairwise CQnq)anson of two nucleotide sequences. ''BLAST 2 
Sequaices" can be accessed and used interactively at http-7/www.ncbi.nlm.nih.gov/gorf1)12 Jitml. The 
**BLAST 2 Sequences" tool can be used for both blastn and blas^ (discussed bdow). BLAST 
programs are commonly used with gap and other paramet^s set to default settings. For exan[q)le, to 
5 compare two nudeottde sequaices, one may use blastn with the •*BLAST 2 Sequences" tool Version 
2.0. 12 (^pril-21-2000) set at default paramet^s. Such default parameters may be» for sample: 

Matrix: BLOSUM62 

Reward for nmtch: I 

Penalty for mismatdi: -2 
10 Open Gap: 5 and Extension Gap: 2 pemlties 

Gap X drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

15 Porcaitid^tymaybemeasuredoverthelengthof an aitire defined sequQice* to as 

d^ned by a particular SEQ ID number^ or may be measured over a shorter length, for eKample^ over 
thel^igthof a fragment taken from a larger, defined sequence, to afragm^of atleast20, at 

least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 

20 shown ho-dn, in the tables, figures, or Sequaice Listing, may be used to describe a lengtti over which 
perc^itageid^ty may be measured 

Nucldc acid sequm:es that do not show a high degree of identity may neverthdess encode 
similar amino add sequmces due to the degeneracy ofthe genetic code. It is understood tiiatdianges in 
a nucldc add sequ^ce can be made using this d^eno-acy to produce multiple nucldc add sequences 

25 that all ^ode substantially the same protdn. 

The phrases '^percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matdies betwe^ at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence aUgnment are wdl-knowa Some alignmeait 
methods take into account conservative amino add substitutions. Sudi conservative substitutions, 

30 ejq)lahied in more detail above, generally preserve the diarge and.hydrophobidty at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity betwe^ polypeptide sequences may be d^mnined usmg the default parametars 
of tiie CLUSTAL V algoritiim as incorporated into tiie MEGALIGN vo-sion 3. 12e sequmce alignment 
program (described and refer^iced above). For pairwise alignments of polypq>tide sequences using 
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CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap peDalty^3, window=5, and 
"diagonals saved"=5. The PAM250nmtrix is sdected as liiedefaiat residue wdght As with 
poiynucleotide alignments, the percent id^ty is rq)arted by Ol^USTAL V as the ''percent similarity'* 
betweaa aligned polypeptide sequence pairs. 
5 Altemativdy the NCBI BLAST software suite may be used. For example, for a pairwise 

con5)arison of two polypeptide sequraoes, one may use the "BLAST 2 Sequoices" toed Version 2.0, 12 
(April-21-2000) with blastp set at defoult parameters. Such default parameters may be, for exan^le: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
10 Gap X drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Perc^t id^ty may be measured ova: the length of an entire defined polypeptide sequence, for 

15 exaniple, as defined by a particular SEQ ID numbo:, or may be measured over a shorter length, for 
example, over the l^gth of a firagmait taltm from a larger, defined polypeptide sequ^ice, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Sudi lengths are exen^lary only, and it is understood that any fragment loigtli 
supported by the sequences shown herdn, in the tables, figures or Sequence Listing, may be used to 

20 describe a length over which percentage identity may be measured. 

"Human artificial diromosomes" (HACs) are linear microchromosomes wliich may contain 
DN A sequ^ices of about 6 tt> to 1 0 Mb in size and which contain all of the danents required for 
chromosome rq)lication, segregation and maint^iance. 

The tam •'humanized antibody** refors to an antibody molecule m which the amino add 

25 sequence in the non-antigen binding regions has be^ altered so that the antibody more dosdy 
resembles a human antibody, and still retains its original binding ability. 

' "Hybridization" rders to the process by wMdi a polynucleotide strand anneals with a 
con?)lemaQtary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nuddc acid sequMices share a high degree of complementarity. 

30 Specific hybridization complexes form under permissive annealing conditions and ranain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in detmntahig the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific bindhig. i.e., 
binding between pairs of nuddc add strands fliat are not perfectty matched Permissive conditions for 
an ne ali n g of nucleic acid sequences are routuidy d^erminable by one of ordinary skill in the art and 
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may be consistent among hybridization expedmi^, wliereas wash conditions may be varied among 
experiments to adueve the desired stringency, and tberefore hybridization specificity. Permissive 
annealing conditions occur, for ^cample, at 68T in the presm^e of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 figM sheared, denatured salmon sperm DNA 
5 Graierally, stringency of hybridization is e9q}ressed, in part, with reference to the temperature 

umkr which the wash stq> is carried out. Sudi wash tenQ)sratures are typically selected to be about 
5T to 20°C lower than the thermal mdting pdnt (TJ for tibie specific sequoice at a defined ionic 
strength and pH. The T„ is the t0Q[q)^ature (under deGned ionic strength and pH) 
targ^ sequaice hybridizes to a perfectly matched probe. An equation for calculating T„ and conditions 

10 for nucleic add hybridization are wdl known and can be found in Sambrook, J. et al. (1989) Molecular 
aoDing: ALaboratorv Manual, ed., voL 1-3, Cold Spring Harbor Press, Plahivlew NY; spedficaUy 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present inv^Hon 
indude wash conditions of 68^*0 in the presraice of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 

15 Altamativdy. tempearatures of about 65°C, 60°C, 55°C, or 42^ may be used. SSC concenhration may 
be varied from about 0.1 to 2 x SSC, with SDS bdng present at about 0.1%. Typically, blocking 
reag^ are used to block non-specific hybridization. Sudi blocking reagents include, for instance^ 
sheared and d^tured salmon sperm DNA at about 100-200 ^g^. Organic solvent, sudi as 
fonnamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 

20 sudi as for RNA:DNA hyliridizations. Useful variations on these wash conditions will be readily 
appareait to those of ordinary skill in the art Hybridization, particularly under high stringency 
conditions, may be suggestive of evoJutionary similarity between the nudeotides. Sudi similarity is 
strongly indicative of a similar role for the nucleotides and thdr encoded polypeptides. 

Hie tmn "hybridization complex" refers to a con^plex formed between two nudeic add 

25 sequCTces by virtue of the formation of hydrogen bonds between con5)lenmtary bases. A hybridization 
con?)lex may be formed m solution (e.g., Cot or Rot analysis) or formed b^een one nucldc add 
sequence present in solution and another nuddc add sequ^ice unmobilized on a solid support (e.g., 
paper, membranes, fflters, diips, pins or gilass slides, or any other s^opriate substrate to wMdi cdls 
or thdr nucldc adds have been fixed). 

30 The words "hisotion" and "addition" refiesr to chaises in an amino add or nudeotide sequence 

resulting in the addition of one or more amino add residues or nudeotides, respectivdy. 

"Lmnune response" can refer to conditLons assodated with inflammation, trauma, immune 
disord^s, or infectious or genetic disease, etc. These conditicnis can be characterized by expression of 
various factors, e.g., cytokines, dionoldnes, and other signalbg molecules, whidi may affect cdlular 
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and systemic defense systems. 

An "inmmnogenic fragnooit" is a polypeptide or oligopeptide fragmmt of NTT which is 
capable of diciting an nnmune response when introduced into a living organism, for exan5)le, a 
mammal. The term "hnraunogaiic fragment" also includes any polypeptide or oligopq)tide fragment of 
5 NTT which is useful in any of the antibody production methods disclosed herein or known in the art. 

The tmn •'miaroarray" refiars to an arrangon^ of a plurality of polynucleotides, polypeptides, 
or other dbemical oon;)ounds on a substrate. 

The terms "demfflf * and "array eiematf '-refer to a polynucleotide, polypq)tide, or other 
chemical compound having a unique and defined position on a microarray . 
10 The tem**modulate** refers to a change in the activity of NTT For example, modulation may 

cause an increase or a decrease in protein activity, binding charactmstics, or any other biological, 
ftmctional, or bnmunological prop^es of NTT. 

The phrases "nucleic acid" and "nucldc add sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refo: to DNA or RNA of genomic or 
15 synflietic origin which may be single-stranded or double-stranded and may represent the saise or the 
antisense strand, to pqrtide nuddc add (PNA), or to any DNA-like or RNA-litoe mataial. 

"Operably linked" refers to the situation in whidi a first nucldc acid sequence is placed in a 
functional relationship with a second nudeic add sequence. For instance, a promoter is operably 
Imked to a coding sequence if the promoter aCffects the transcription or expression oftb& coding 
20 sequence, Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

*Tqjtide nucldc add" (PNA) refias to an antisme molecule or anti-gene agent whidi 
comprises an oligonucleotide of at least about 5 nudeotides in length linked to a peptide backbone of 
amino add residues ending in lysuie. The terminal lysme confers solubility to the con?)ositioa PNAs 
25 preferentially bind complementary single stranded DNA or RNA and stop transcript dongation, and 
may be pegylated to extend their lifespan in the cdL 

*Tost-translational modification" of an NTT may involve lq)idation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other UKxIifications known m the 
art These processes may occur syntheticany or biodiemically. Biochanicalmodifications will vary by 
30 cell type depending ontfaeOTzymatic milieu of NTT. 

"Probe" refers to nucleic acid sequences encoding NTT, their complements, or fragments 
thereof, which are used to detect identical, aUelic or rdated nucleic add sequences. Probes are 
isolated oHgonucleotides or polynucleotides attached to a detectable labd or reporter molecule. Typical 
labds include radioactive isotopes, ligands, diemiluminescent agents, and enzymes. **Prim£rs" are 
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short nucldc acids, usuafly DNA oligonudeolides, which may be annealed to a target polynucleotide by 
cain)lementary base-pairing. The primer may then be extotided along the target DNA strand by a DNA 
- polymerase eozyme. Primer pairs can be used for an9)Mcation (and identification) of a nucleic add 
sequence* e.g., by the polymerase chain reaction (PGR). 
5 Probes and primers as used in the presoit invention typically conprise at least IS contiguous 

nucleotides of a known sequence. In ordo: to enhance spedfldty, longer probes and primers may also 
be employed, such as probes and prime's that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least ISO consecutive nucleotides of the disdosed nucldc acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any lengOi supported by the 

10 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for prq)aring and using probes and primers are described in the references, for 
exan?)le Sambrook, J. et aL (1989) Molecular Clodng: A Laboratca-y Manual, 2"^ ed., vol. 1-3, Cold 
Sitting Harbor Press, Hainview NY; Ausubd, F.M. et al. (1987) Current Protocols in Moleailar 
Biology . Gre^ Publ. Assoc. & Wiley-Intersdoices, New York NY; Innis, M. ^ al. (1990) PgR 

IS Protocds. A Guide to Met hods and Applications , Academic Press. San Diegn PA PCR prima* pairs 
can be derived from a known sequence^ for exanqple, by using computer programs int^ided for tbat 
purpose sudi as Prima: (Version 0.5, 1991, Whitdiead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are sdected using software known in the art for sudi 
20 purpose. For example, OLIGO 4.06 software is useful fc»: tte sdection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonudeotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 Mlobases. Similar primer sdection 
programs have incorporated additional features for e3q)anded c^abilities. For example^ the PrimOU 
primer sdection program (available to the public from the Gaiome Center at University of Texas South 
25 West Medical Centar, Dallas TX) is capable of dioosing specific primers from megabase sequeoces 
and is thus useful for designing primers on a g^iome-wide scope. The PrimerS primer sdection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Researdi, 
Cambridge MA) allows the user to input a "mispriming library," in whidi sequences to avoid as primer 
binding sites are user-specified. PrimerS is useful, hi particular, for the selection of oligonudeotides for 
30 miaoarrays. (The source code for the latter two prima: sdection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to Hie public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multi^ie sequence alignments, thereby allowing sdection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucldc add sequences. 
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Hence, this program is useful for identificatiaii of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PGR or sequendng 
primes, microarray dements, or specific probes to identify fully or partially con?)lemfintary 
5 polynucleotides m a sample of nucleic adds. Methods of oligonudeotide selection are not limited to 
those described above. 

A 'Recombinant nucleic add" is a sequeaice that is not naturally occurring or has a sequence 

that is made by an artifi.dal combination of two or more otherwise separated segments of sequraice. 

This artifidal combination is o&ea accomplished by diemical synthesis or, more commonly, by the 
10 artificial manipulation of isolated segments of nucldc adds, e.g. , by gattetic enghieering tedmiques 

sudi as those described in Sambrook, supra . The term recombinant includes nucldc adds that have 

been altered soldy by addition, substitution, ot deletion of a portion of fiie nucldc add Frequaifly, a 

recombinant nuddc add may indude a nucldc add sequ^ice qperably linked to a promoter sequence. 

Sudi a recombinant nucldc add may be part of a vectOT that is used, for exanple, to transform a cdl. 
15 Altamtivdy, such recombinant nuddc adds may be part of a viral vector, e.g., based on a 

vacdnia virus, that could be use to vaccinate a mammal whadn the recombinant imcldc add is 

expressed, inducing a protective immunological response in the mannTifli , 

A 'Regulatory dement" refers to a nuddc add sequence usually derived from untranslated 

r^ons of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions (UTRs). 
20 R^ulatory dements interact with host or viral proteins whidi control transoiption, translation, or RNA 

stability. 

"Rq)Qrta: molecules" are diadcal or biodianical moieties used for labeling a nuddc add, 
amino add, or antibody. Reporter molecules include radionudides; aizymes; fluorescent, 
chemitaminescent, or chromogenic agaits; substrates; cofactors; inhibitors; magi^c particles; and 
25 other moieties known in the art. 

An "RNA equival^t," in referrace to a DNA sequence, is composed of the same linear 
sequaiceof nudeotides as IherdieraK^DNAsequaice with the excq)tion that all occurrences of the 
nitrogwious base thymine are replaced with uracil, and the sugar backbone is coiqposed of ribose 
instead of deoxyribose. 

30 The term "sample" is used in its broadest sense. A sample suspeded of containmg NTT, 

nucldc adds flooding NTT, or fr agmoits thereof may comprise a bodily fluid; an eidract from a cdl, 
diromosome, argandle, or membrane isolated from a cdl; a cdl; genomic DNA, RNA, or cDNA, in 
sdution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
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protdn or pq)tide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding conq)ositiQn. Tbe interaction is dep^idoat upon the presence of a particular structure 
oftheprotdn,e.g., the antig^cdetenniiiant or q>itQpe, recognize For 
exauoDple, if an antibody is specific for epitqpe "A," the preseoce of a polypq)tide comprising tbe epitope 
5 A, or thepresQice of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labded A that binds to the antibody. 

Tbe tmn. "substantiany purified" refits to nucleic acid or amino add sequences that are 
removed from their natural enviroinment and are isolated or separated, and are at least 60% free, 
preferably at least 75 % free, and most prefiM-ably at least 90% free from other CQn?)onaits with which 
10 1h^ are naturally associated. 

A "substitution" rrfers to the rq)lacOTeiit of one or more amino acid residues or nucleotides by 
(Merent amino acid residues or nucleotides, respectively. 

"Substrate" refas to any suitable rigid or semi-rigid support includhig membranes, filters, 
chips, slides, waf(a*s, fibers, magnetic or nonmagnetic beads, gds, tubing, plates, polymers, 
15 miaqparticles and capillaries. The substrate can have a variety of surface forms, such as wdlls, 
trenches, pins, channels and pores, to whidi polynucleotides or polypeptides are bound 

A "transcript image" refors to the collective pattern of goie expression by a particular ceQ type 
or tissue under giv^ conditions at a giv^time 

"Transformation" describes a process by whidbi exogenous DNA is introduced into a recipient 
20 cdl. Transfcmnationnmy occur under natural or artifidal conditions accQ^ 

known in the art, and may rely on any known method for the ins ertion of fordgn nucleic acid sequences 
intoaprokaryoticoreukaryotichostcelL The metbod for transformation is sdected based on the type 
of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 
decfroporation, heat shock, lipofection, and particle bombardment Ibe term *transf(ffmed cdls" 
25 includes stably transformed cdls in whidi the insated DNA is capable of rfi?)lication dtiier as an 

autonomously rq)licating plasmid or as part of tiie host djromosome, as weU as transiaifly transformed 
cells which express the inserted DNA or RNA for Ihnrted periods of time. 

A "transgenic organism," as used hearem, is any organism, including but not limited to 
animals and plants, in whidi one or more of the cells of the organism contains heterologous nucleic 
30 add introduced by way of human intervention, sudi as by transgenic techniques wdl known in the 
art Ttie nucleic acid is mtroduced into the cell, directty or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or m 
vitro fertilization, but rathear is durected to the introduction of a recombinant DNA molecule. The 
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transgenic organisms conten^lated in accordance with the present invention indnde bacteria, 
cyanobact^a, fungi, plants and animals. The isolated DNA of Ihe present invention can be 
introduced into the liost by methods known in the art, for exanq)le infection, transfection, 
transformation or transconjugation. Techniques for transf oiing the DNA of the present invention 
5 into such organisms are widely known and provided in rtferences such as Sambrook et al. (1989), 
supra . 

A 'Varianf ' of a particular nudeic add sequence is defined as a nuddc add sequence having at 
least 40% sequence ide^itity to the particular nucldc add sequence over a certain length of one of tbe 
nucldc acid sequences using blastn with the "BLAST 2 Sequaices" tool Version 2.0.9 (May-07-1999) 

10 sd at default paramet^s. Such a pair of nucldc adds may show, for exanple, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%. at least 90%, at least 91%, at least 92%, at least 93%. at 
least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
idaitity over a certain defined laigth. A variant may be described as, fcnr example, an "alldic" (as 
defined above), "splice." "spedes," or **polymarphic" variant. A splice variant may have significant 

15 id^tity to a reference molecule, but will generally have a greater (x lesser number of polynudeotides 
due to altmiative splidng of exons during mRNA processing. The correspondmg polypeptide may 
possess additional functional domaiins or lack domains that are present in tbe reference molecule. 
Spedes variants are polynucleotide sequences that vary from one spedes to another. The resulting 
polypeptides will g^ierally have significant amino add idaitity relative to eadi other. A polymorphic 

20 variant is a variation in tiie polynucleotide sequence of a particular gene between individuals of a given 
spedes. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
whidi the polynudeotide sequaice varies by one nucleotide base. The presence of SNPs may be 
indicative of, for exanqple^ a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequ^ce is defined as a polypeptide sequence having at 

25 least 40% sequenceideiitity to the particular polypeptide sequence over a certain laigtti of one 

polypeptide sequences using blastp with tiie "BLAST 2 Sequences" tool Version 2.0.9 CMay-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for exanq)le, at least 50%, at least 
60%, at least 70%. at least 80%, at least 90%. atleast91%, at least 92%, at least 93%, at least 94%, at 
least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence identity over a 

30 certain defined length of one of the polypq)tides. 

THE INVENTION 

The invention is based on the discovery of new human neurotransmitter transporters (NTT), the 
polynucleotides encoding NTT, and flie use of these compositions for the diagnosis, treatmait, or 
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prevention of transport; neurological and psychiatric disorders. 

Table 1 summarizes tbe nomenclature for the full length polynucleotide and polypqytide 
sequoices of the invention. Each polynucleotide and its corresponding polypqptide are correilated to a 
single Incyte project idoitification nuinber (Incyte Project ID). Each polypeptide sequence is denoted 
5 by.boHi a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypq)tide sequence number (Incyte Polypeptide ID) as sho^ Each polynucleotide sequence is 
d^ted by both a polynucleotide sequsoce identification number polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide cons^isus sequence number (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences witii homology to the pdypqrtides of the invention as identified by 

10 BLAST analysis against the GoiBank protein (genpept) database. Columns 1 and 2 show the 

polypq)tide sequence identification numbo: (Polypeptide SEQ ID NO:) and the corresponding Incyte 
polypq>tide sequMice number (fiicyte Polypeptide ro) for polypqptito Column 3 

shows the GenBank identification number (Grabank ID NO:) of the nearest GenBank homolog. 
Column 4 shows Ibe probability score for the match between each polypq)tide and its GenBank 

15 homolog. Cohunn 5 shows the annotation of tbe GenBank homolog along with relevant citations where 
applicable, all of whidh are expressly incorpOTated by reference herein 

Table 3 shows various structural features of Itoe polypeptides of the invention Columns 1 and 2 
show the polypeptide sequaice identification numbo: (SEQ ID NO:) and the corresponding Incyte 
polypqmde sequence number (Incyte Polypqmde ID) for eadipoly^ Columns 

20 shows the number of amino add residues in each polypeptide. Column 4 shows potential 

phosphorylation sites, and jx)hmm 5 shows potential glycosylation sites, as determined by the MOTIFS 
program of the GCG sequence analysis software package (Graetics Con?)uter Group, Madison WI). 
Column 6 shows amino add residues comprising signature sequaices, domains, and motifis. Column 7 
shows analytical methods for protdn structure/function analysis and in some cases, searchable 

25 databases to which the analytical methods were applied. 

Tog^QT, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are neurotransmitter transporters. For exau5)le, SEQ 
ID N0:4 is 95 % identical to human taurine transporter (GenBank ID g799339) as ddenmned by the 
Basic Local Alignment Seardi Tool (BLAST). (See Table 2.) The BLAST probabiUty score is 0.0. 

30 which indicates the probability of obtaining the observed polypeptide sequence alignment by chance. 
SEQ ID N0:4 also contains a sodium:neurotransmitta: symporter family (SNF) signature as determined 
by seardiing for statistically significant matches in tiie hidden Markov model (HMM)-based PFAM 
database of conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 
PROFELESC AN analyses provide ftirther corroborative evidaice that SEQ ID N0:4 is an SNF 
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neurotansmitter trassporter. In an sOtemate exanple. SEQ ID N0:5 is 94% id^cal to human 
dopamine transporter (GenBank ID gl816S6) as detennined by tbe Basic Lxx;al Aligrnn^ Search Tool 
(BLAST). (See Table 2.) The BLAST probability score is 0.0, which indicates the pn^ability of 
obtaining tiie observed polypeptide sequm^e aligran^ by chance. SEQ ID N0:5 also contains a 
5 sodium necir<^anspQrta: synq)orter family domain as determined by searching for statisticany 

significant matdbes in the hiddm Markov modd (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide 
further corroborative evidsoce that SEQ ID N0:5 is a dopamine transporter. In an alternate exan^le. 
SEQ ID NO:6 is 90% identical to rat neurotransmitter transporter rB21a (GenBank ID g914028; 

10 Smith, K.E. et aL (1995) FEBS L^. 357:86-92) as detennhied by the Basic Local Alignment Search 
Tool (BLAST). (See Table 2.) The BLAST probability score is 5.7e-300, vMch indicates the 
probability of obtaining the observed polypq>tide secpience alignment by chance. SEQ ID NO:6 also 
contains sodrumuKwrotransnutter symporter family domains as d^ermined by searching for statistically 
significant matches in the hidden Markov model (HMNQ-based PFAM database of consarved protein 

15 family domams. (See Table 3.) Data from BLIMPS, PROFILESCAN, and BLAST analyses provide 
further corroborative evidence that SEQ ID NO:6 is a sodiumrneurotransmitter syn^XHler. SEQ ID 
N0:1, SEQ ED N0:2, and SEQ ID N0:3 wore analyzed and annotated in a similar manner. The 
algorithms and parameters for the analysis of SEQ ID NO: 1-6 are described in Table 7. 

As shown in Table 4, the fun length polynucleotide sequoices of the present invention were 

20 assembled using cDNA sequences or coding (exon) sequaices derived from genomic DNA, or any 
combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification numiber (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence nmnber (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
ColumnS^owstiielengthof each polynucleotide sequence in basq)airs. Column 4 lists fragments of 

25 the polynucleotide sequences which are useful, for wcample, in hybridization or amplification 

technologies tiiat identify SEQ ID NO:7-12 or that distinguish between SEQ ID NO:7-12 and related 
polynucleotide sequences. Ccdumn 5 shows idaitification zuunbers corresponding to cDNA sequences, 
coding sequences (exons) predicted from gosomic DNA. and/or sequence asseniblages comprised of 
both cDNA and genomic DNA. These sequences were used to assanble the full length polynucleotide 

30 sequfflices of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') and stop (3') 
positions of the cDNA and genomic sequaices in column 5 relative to thear respective fidl length 
sequences. 

The identification numbas in Column 5 of Table 4 may refa: specifically, for exan?)le, to 
Incyte cDNAs along with their aaresponding cDNA libraries. For example^ 
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5407696F6 is the identification number of an Incyte cDNA sequ«K», and BRAMNOTOl is the cDNA 
library from which it is derived. Incyte cDNAs fiar which cDNA libraries are not indinated were 
derived from pooled cDNA libraries (e.g., 70501086V1). Atonativdy, the identification numbers in 
cdmnn 5 may refer to GenBank cDNAs or ESTs (e.g. , gl748013) which contributed to the assembly of 
5 the full iQigth polynucleotide sequm:es. Altenmtivdy, the identification numbers in cohunnS may 
r^er to coding regions predicted by Genscan analysis of genomic DNA The Genscan-predicted coding 
sequences may have been edited prior to assmbly. (SeeExan^lelV.) Alternatively, the identification 
numbers in column 5 may refer to assemblages of both cDNA and Genscan-predicted exons brought 
together by an "exon stitching" algorithm. (See Example V.) Alternatively, the idaitification numbm 
10 in column 5 may refer to assemblages of both cDNA and Genscan-predicted exons brougjit together by 
an "exon-stretching" algorithm. For exanq)le, FL7472800_g7108462_gl81656 is the identification 
number of a "stretched" sequence, with 7472800 being the Incyte project identification number, 
g7108462 being the GenBank identification number of the human genomic sequence to which the 
"exon-stretdiing" algorithm was applied, and gl81656 being the GenBank identification number of the 
15 nearest GenBank protdnhomolog. (See Example V.) In some cases, Incyte cDNA coverage redundant 
with the sequence coverage shown in column 5 was obtained to confirm the final consensus 
polynucleotide sequence, but the relevant Incyte cDNA idaitification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those fuU length polynucleotide sequences 
which were assembled using Incyte cDNA sequaices. The representative cDNA library is the Incyte 
20 cDNA library whidi is most frequoifly rq)resented by the Incyte cDNA sequences vMch were used to 
assemble and confirm the above polynucleotide sequences. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses NTT variants. A preferred NTT variant is one which has at 
least about 80%, or altemativdy at least about 90%, or even at least about 95% amino acid sequence 
25 identity to the NTT amino add sequence, and wMdi contains at least oiiefimctional or structural 
characteristic of NTT. 

The invention also encon5)assespolynuciectideswMchaKX^ In a particular 
enabodiment, the invention ^con^asses a polynucleotide sequence comprising a sequence selected from 
tiie ffcmp consisting of SEQ ID NO:7-12, Mdiich encodes NTT. The polynucleotide sequences of SEQ 
30 ID NO:7-12, as presented m tiie SequMice Listing, embrace tiie equival^t RNA sequences, wherein 
occurreaces of the nitrogaious base thymme are rq)laced with uracil, and tiie sugar baddjone is 
conqx)sed of ribose instead of deoxyribose. 

The inv^on also enoon?)asses a variant of a polynucleotide sequence encoding NTT. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or altemativdy at least 
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about 85%, or epvea at least about 95% polynucleotide sequence idoitity to the polynucleotide sequence 
^coding NTT, A particular aspect of the inveotion eiicon$)asses a variant of a polynucleotide 
sequ^e comprising a sequence selected from Oe grcFi^ consisting of SEQ ID NO:7-12 which has at 
least about 70%, or altemativdy at least about 85%, or even at least about 95% polynucleotide 
5 sequence identity to a nucleic add sequence selected from the group consisting of SEQ ID NO:7-12. 
Any one of tlie polynucleotide variants desonbed above can encode an amino acid sequence which 
contains atleastonefiuK^onalorstnicturalcharactensticof NTT. 

It wai be ^preci ated by those skilled in the art that as a result of the degeneracy of tlie genetic 
code, a multitude of polynucleotide sequences aicoding NTT, some bearing mhihnal similarity to the 

10 polynucleotide sequences of any laiown and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting coinbinations based on possible codon choices. These CQmbinations are made m 
accordance with the standard triplet genetic code as implied to the polynucleotide sequence of naturally 
occurring NTT, and all sudi variations are to be considered as being specifically disclosed. 

15 Although nucleotide sequences which aioode NTT and its variants are gaierally capable of 

hybridizing to the nucleotide sequaice of the naturally occurring NTT under appropriately selected 
conditions of shingency, it may be advantageous to produce nucleotide sequences encodhig NTT or its 
derivatives possessing a substantially differ^ codon usage» e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the pq)tide occurs in a 

20 particular prokaryotic or eukaryotic host in accordance with the frequaicy with which particular codons 
are utilized by the host. Other reasons for substantially altmng Hie nucleotide sequence encoding NTT 
and its derivatives Mthout altering the encoded amino acid sequences include the production of RN A 
transcripts having more desirable properties, such as a greater half-life, than transcripts produced from 
the naturally occurring sequence. 

25 The invaition also encon:?)asses production of DNA sequences which encode NTT and NTT 

dffivatives, or fragmiaits tbo-eof, entirdy by synthetic chanistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using 
reagents wdl known in the art. Moreover, synthetic chemistry may be used to introduce mutations into 
a sequence encoding NTT or any fragment ttiereof . 

30 Also encon5)assed by the invention are polynucleotide sequences that are capable of 

hybridizmg to the claimed polynucleotide sequaices, and, m particular, to those shown in SEQ ID 
NO:7-12 andfragments thereof undervarious conditions of stringmcy. (See, e.g., Wahl, G.M. and 
S.L. Bergo: (1987) Methods Enzymol. 152:399-407; Kunmd, A.R. (1987) Methods Enzymol. 
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152:507-511.) Hybridizatioa conditions^ including annealing and wash conditions* are desoibed in 
"Definitions." 

Mdliods for DNA sequencing are well known in the art and may be used to practice any of tbe 
embodiments of the invention. Tlie niettiodsnmyeii;}loy such enzymes as the Klenowfragm^ 
5 DNApdymfirase I, SEQUENASE (US Biodiemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proo&eadiAg exonucleases such as those found in the ELONGASE 
anq)Mcation system (Life Technologies, Gaitfaersburg MD). Pr^erably, sequence preparation is 
automated with machines such as the MCROLAB 2200 liquid transfer system (HamiltQn, Reno NV), 

10 PTC200 thranal cycla: (MJ Research. Watertown MA) and ABI CATALYST 800 thmnal cyder 

(Applied Biosystems). Sequaicing is thai carried out using dthex the ABI 373 or 377 DNA sequencing 
systan (Applied Biosystems), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or other syst^ns known in the art The resulting sequeiK^ are analyzed using a 
variety of algorithms which are well known in the art (See, e.g., Ausubd, F.M. (1997) Short Protocols 

15 in Molecular Biology, John Wfley & Sons. New York NY, unit 7.7; Meyers, R. A. (1995) Molecular 
Biologv and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

Tbe nacldc add sequences encoding NTT may be ex:tended utilizing a partial nucleotide 
sequm» and employing various PCR-based methods known in the art to detect upstream sequences, 
sudi as promoters and regulatory dem^its. For exanq)le, one method which may be eiq)loyed, 

20 resttiction-site PCR, uses universal and nested primers to amplify unknown sequence from g^mic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR M^hods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that ejrtrad in divergent directions to amplify unknown 
sequttice from a drcularized totqplate. The tonplate is derived from restriction fr agmrats comprishig a 
known gCTomic locus and surrounding sequences. (See, e.g., Triglia. T. et al. (1988) Nucleic Adds 

25 Res. 16:8186.) Athirdmetfaod, capture PCR, involves PCR an5)lification of DNA fragments adjacent 
to kmjwn sequ^ces m human and yeast ardfidal chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this mdfaod,multq)le restriction enzyme digestions and 
ligations may be used to insert an oigineered double-stranded sequence into a r^on of unknown 
sequ«ice before performing PCR. Other methods which may be used to retrieve unknown sequences 

30 are known in the art. (See, e.g., Parker, J.D, et al. (1991) Nudeic Adds Res. 19;3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontedi, Palo 
Alto CA) to walk gaiomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software^ sudi as OLIGO 4.06 prnner analysis software (National Biosdences, 



36 



wo 01/90148 



PCT/USOl/16283 



Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in laigth, to have a 
GCcoDtentof about 50% or more, andtoannealtotheten5)lateattanpa:atures of about68°Cto 
72T. 

When screenmg for full loigth cDNAs, it is preferable to use libraries that have been 
5 size-sdected to include larga: cDNAs. In addition, randomrprimed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in wMch an oligo d(T) library 
doesnotyiddafijll-lengtlicDNA. Genomic libraries may be useful ficr extension of secpenc^ 
non-transcribed regulatory r^ons. 

CapiQary dectrqphoresis systems whidi are commercially available may be used to analyze the 

10 size or confirm the nudeotide sequMce of sequencing or PCR products. In particular, capillary 
sequencing may easploy flowable polymers for dectrophoretic separation, four different nudeotide- 
specific, laser-stimulated fluorescent dyes, and a diarge coupled device camera for detection of the 
emitted wavd^gths. Output/light intoisity may be convoled to dectrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystms), and the entire 

15 process from loading of sanples to coDHputer analysis and dectronic data display may be con5}uter 
controlled. Capillary dectrophoresis is especially prefa^able for sequendng small DNA firagmoots 
vMch may be present in limited amounts in a particular s an^le. 

In another embodnnent of the invention, polynudeotide sequences or fragments thereof whidi 
emx)de NTT nmy be cloned in recombinant DNA molecules tiiat direct ex^^ or 

20 fragm^ or functional equivalents thereof, in appropriate host cdls. Due to the inherent deg^ieracy of 
the gaietic code, other DNA sequojces wliidi encode substantially the same or a functionally equivalent 
amino add sequence may be produced and used to ^ress NTT. 

The nucleoticte sequences of the present inv^on can be engmeared using methods generally 
known in the art in order to alter NTT-^icoding sequm^es for a variety of purposes induding, but not 

25 limited to, modification of the cloning, processing, and/or expression of tiie gene product. DNA 
shuffling by rauttom firagm^ation and PCR reassembly of gene fragments and synttietic 
oligonudeotides may be used to engineer tiie nucleotide sequences. For exan?)le, oligonudeotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codonpreferaice, produce splice variants, and so forth. 

30 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygenlnc, Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotedmol. 17:793-797; Christians, EC. et al. (1999) Nat 
Biotechnol. 17:259-264; and Cram^. A et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
hnprove the biological properties of NTT, such as its biological or enzymatic activity or its ability to 
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bind to otba: molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
prpporties. These preferred variants may then be pooled and further subjected to recursive rounds of 
5 DNA shofEUng and selection/scre^iing. Ihus, genetic diversity is created through "artifidal** 

breeding and riqpid molecular evohition. For example, fragments of a single gene containing random 
pomt mutations may he recombined, screened, and then reshufQed until the desired properties are 
optimized. Altamativdy, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, eitha: from the same or different spedes, theareby 
10 maximizing the genetic divo-sity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another mibodiment, sequences encoding NTT may be synthesized, in whole or in part, 
using chemical methods wdl known in the art (See> e.g., Caruthers, M.H. et al. (1980) Nucldc Adds 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nuddc Adds Syn?>. Sear. 7:225-232.) Altemativdy, 

15 NTTitsdf or a fragnii^ thereof nmy be synthesized using diemicalmed^ For exan^le, pq)tide 
syndiesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Gdghton, T. (1984) Protdns. Structures and Molecular Propgties, WH Freeman, New York NY, pp. 
55-60; and Roborge, J.Y. dt aL (1995) Sdaice 269:202-204.) Automated synthesis may be achieved 
using the ABI 431 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequaice 

20 of NTT, or any part thereof, may be altered during direct synfliesis and/or combined with sequences 
from other protdns, or any part thereof, to produce a variant polypqptide or a polypqrtide liaving a 
sequence of a naturally occurring polypeptide. 

The pqptide may be substantially purified by prq)arative high performance liquid 
chromatography. (Se^ e,g., Chiez, R.M. andRZ. Regnier (1990) Methods Enzymol. 182:392-421.) 

25 The composition of the synlh^ peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Crdghton, supra> pp. 28-53.) 

In order to ©cpress a biologically active NTT, the nudeotide sequences oicoding NTT or 
derivatives thereof may be mserted mto an q)propriate expression vector, Le., a vector vMdii contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 

30 suitable host. These elements mdude regulatory sequences, sudi as eaihancers, constitutive and 
inducible promoters, and 5 ' and 3 ' untranslated regions in the vector and in polymideotide sequences 
encoding NTT. Suchdementsmay vary in thdr strength and spedfidty. Specific initiation signals 
may also be used to achieve mOTe efficient translation of sequences encoding NTT. Sudi signals 
indude the ATG initiation codon and adjacent sequ^ices, e.g. the Kdzak sequence. In cases where 
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sequeaces encoding NTT and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate ^qpression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequ©tice» or a fragment thoreof, is inserted, exogaious 
translational control signals including an in-frame ATG initiation codon should be provided by the 
5 vector. Exogenous translati(nial donaits and initiation codons may be of various origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of oihancers appropriate 
for the particular host ceiQ system used. (See, e.g., Sdiarf, D. et al. (1994) Results ProbL Ceil Differ. 
20:125-162.) 

Methods which are weU known to those skilled in the art may be used to construct expression 

10 vectors containing sequ^ices encoding NTT and ^ropriate transcriptional and translational contrcd 
el^nents. These methods include m vitro recombinant, DNA techniques, synthi^c techniques, and in 
vivo graetic recombhiation. (See, e.g., Sambrook, J. et al. (1989) Molecular Clonuig. A Labaratorv 
Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausub^ F.M. et al. (1995) 
Curroit Protocols in Mdecular Bioloev . John Wiley & Sons, New YOTkNY, ch. 9, 13, and 16.) 

15 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding NTT. Hiese mdude, but are not limited to, miaroorganisms such as bacteria transformed 
with recombinant bacteriqphage, plasmid, or cosmid DNA expression vectors; yeast transfOTned with 
yeast eacpression vectors; insect cell systenoB mfected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral ejqjression vectors (e.g., cauliflower mosaic virus, CaMV, or 

20 tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal ceJl systems. (See, e.g., Sambrook, supra : Ausubel, supra : Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K et al. (1994) Proc. Natl. Acad. Sci. USA 
91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Iher. 7:1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-31 1 ; The McGraw Hill Yearbook of Science and Tedmologv (1992) McGraw Hill, New 

25 York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. ScL USA 81:3655-3659; and 
Harrington, J.J. et al. (1997) Nat Genet 15:345-355.) Expression vectors derived fi-om retroviruses, 
adenoviruses, or heq)es or vaccinia viruses, or from various bactoial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell populatioa (See, e.g., Di 
Nicola, M. et al. (1998) Canoa: Gen. Tber. 5(6):350-356; Yu. M. et al. (1993) Proc. NatL Acad. Sci. 

30 USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGiegor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Veama, I.M. and N. Somia (1997) Nature 389:239-242.) 
Tbe invention is not limited by die host cdlenq)loyed. 

In bacterial systraos, a number of cloning and ejtpression vectors may be sdected dep^uBng 
upon the use intended for polynucleotide sequeiices encoding NTT. For example, routine cloning, 
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sabclomng, and propagaticm of polynucleotide sequences flooding NTT can be achieved using a 
multifunctianal E. coli vector sudi as PBLUESCRIPT (Stratageaie, La Jolla CA) or PSPORTl plasmid 
(Life Technologies). Ligation of sequences encoding NTT into the vector's multiple cloning site 
disrupts the lacZ gene> allowing a colorimetric saeening procedure for identification of transfcarmed 
5 bactaia containing recombinant noolecules. In addition, these vectors may be usefal for in vitro 
transcr^tion» dideoxy sequencing, single strand rescue with h^per phage, and creation of nested 
deletions in the cloned sequaica (See, e.g.. Van Heeke, G. and S.M. Sdmster (1989) J. Biol. Chem. 
264:5503-5509.) Whoi large quantities of NTT are needed, e.g. for the production of antibodies, 
vectors which direct highlevd expression of NTT may be used For example, vectors containing the 

10 strong, inducible SP6 or T7 bacteriophage pronaoter may be used. 

Yeast repression systems may be used for production of NTT. A number of vectors containing 
constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH promoters, may be 
used in the veast Saccharomvces cerevisiae or Pidbia nastoris . In addition, such vectors direct either^ 
secretion or intraceillular retention of expressed proteins and ^ble int^ation of foreign sequences into 

15 Ihe host genome for stable propagation. (See, e.g., Ansubel, 1995, supra : Bitter, G.A. et al. (1987) 
Methods EnzymoL 153:516-544; and Scoro:, CA. et al. (1994) Bio/Technology 12:181-184.) 

Plant systecQs may also be used for expression of NTT. Transoiptian of sequences encoding 
NTT may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or hi 
combination with the oiii^a leader sequaice from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 

20 Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used (See, e.g., Coruzzi, G. et al. (1984) EMBO L 3:1671-1680; Broglie. R. a al. (1984) Sciaice 
224:838-843; and Winto*, J, et al. (1991) Results Probl. Cdl Differ. 17:85-105.) These constructs can 
be mtroduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
e.g., The McGraw HiH Yearbook of Sdmce and Tedmoloev (1992) McGraw Hill, New York NY, pp. 

25 191-196.) 

In mammalian ceUs, a number of viral-based e3q)ression syst^nos may be utilized. In cases 
where an ad^virus is used as an e9q)ression vector, sequaices encodmgNTTmaybeligatedintoan 
adenovirus transaiption^anslation con^lex consisting of tbe late promoter and tripartite leader 
sequence. Insertion in a non-ess^otial El or E3 region of the viral genome may be used to obtain 
30 hifective virus which expresses NTT in host ceils. (See, e.g., Logan, J. and T. Sheik (1984) Proc. 
NatL Acad Sci. USA 81:3655-3659.) In addition, transcription CThanoars, such as the Rous sarcoma 
virus (RS V) enhancea:> may be used to increase eyprftssinn in mainniftHflti ha^ SV40orEBV- 
based vectors may also be used for Mgh-levd proteui expression. 
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Human artifidal duroioDSOines (HACs) may also be snployed to deliver larger fragmi^ of 
DNA than can be cxmtaioed in and expressed from a plasmid. HACs of about 6 kb to 1 0 Mb are 
constructed and delivered via convaitional ddivoy methods (liposomes, polycationic amino polymers, 
or vesides) for therapeutic purposes. (See, e.g., Harrington, JJ. et aL (1997) Nat. Gen^ 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
NTT in cdl lines is preferred. FOTexan5)le, sequaices aicodingNTTcanbetransfonnedi^^ 
ushig expression vectors which may contain viral origms of rq)lication and/or endog^us expression 
demaots and a sdectable marker geoe on the same or on a separate vector. Follomng the introduction 
of the vector, cdls may be allowed to grow for about 1 to 2 days in enriched media before being 
SMdtched to selective media. The purpose of the selectable marker is to confer resistance to a selective 
agent, and its presence allows growth and recovery of cefls whidi successfully express the introduced 
sequences. Resistant clones of stably transformed cdOs may be propagated using tissue culture 
techniques apprcpriateto the cell type. 

Any number of selection systans may be used to recover transformed ceU lines. These include, 
but are not limited to, ttie herpes simplex vhiis thymidine kinase and adenine phosphoribosyltransferase 
geaies, for use m and apr cells, respectivdy. (See, e,g., Wigler. M. et al. (1977) CeU 1 1 :223-232; 
Lowy, 1. ^ aL (1980) Cdl 22:817-823.) Also, antunetabohte, antibiotic, or herbicide resistance can be 
used as the basis for sdectton. For exanq)le, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-4 1 8 ; and and pat confer resistance to 
chlorsulfuron and phosphinotridn acetyltransferase, respectively. (See. ag., Wigler, M. al. (1980) 
Proc. NafL Acad. Sci. USA 77:3567-3570; CoIbere^Garapin, F. al. (1981) J. Mol. Biol 150:1-14.) 
Additional sdectable genes have been desaibed, e.g., trpB and hisD, which alter cdlular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Nafl. Acad. ScL USA 
85 :8047-805 1.) Visible markers, e.g., anthocyanins, greai fluorescoit prolans (GFP; Qontech), B 
glucuronidase and its substrate fi-glucuronide, or ludferase and its substrate lucifedn may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expressi(m attributable to a specific vector system. (See, e.g., Rhodes, C, A 
(1995) M^hods MoL BioL 55:121-131.) 

Although the presenoe/abseoce of marko: gaie expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For e?caii5)le, if the 
sequ^ice ^odmg NTT is inserted within a marker gene sequence, transformed cells containmg 
sequences mcoding NTT can be idaitified by the absence of marker gene functioa Altemativdy, a 
marker gene can be placed in tandatn with a sequence encoding NTT under the control of a smgle 
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promoter. Expression of the marker gene in response to Induction or sdection usually indicates 
expression of tbe tandem gene as wen. 

In gsi^al, liost cells that contain the nucleic add sequence encoding NTT and iSaat express 
NTT may be identified by a variety of procedures known to those of skill in the art. These procedures 
5 indude» but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR amplification, and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
tedmcflogies for the detection and/or quantification of nuddc add or protein sequences. 

Immunological methods for detecting and measuring the expres^on of NTT using dther 
specific polyclonal or monoclonal antibodies are known in the art Exan?)les of sudi techniques indude 
10 eozyn^linked unmunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated ceil sorting (FAGS). A two-site, monodonal-based immunoassay utilizing monodonal 
antibodies reactive to two non-interfering epitqpes on NTT is prefiared, but a competitive binding assay 
maybeen?)loyed. These and otho* assays are well known in the art. (See,e.g.,Hanq)ton,R. etal. 
(1990) Serological Methods, a Labo ratorv Mamial . APS Press. St Paul MN, Sect. IV; Coligan, JJE. 
15 al. (1997) Current Protocols in TmmiiTininprY, Greene Pub. Assodates and Wiley-Intarsdence, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols, Humana Press, Totowa NJ.) 

A wide variety of labcils and conjugation tedmiques are known by those skilled in the art and 
may be used hi various nucldc add and amino add assays. Means for producing labded hybridization 
or PGR probes for detecting sequaices rdated to pdynucleotides encoding NTT include digolabding, 
20 nick translation, end-labding, or PGR an?)lification using a labded nucleotide. Altemativdy, the 
sequences racoding NTT, or any firagm^ thereof, may be cloned into a vector for the production of 
anmRNAprcbe. Sudi vectors are known in the art, are commerdally available, and may be used to 
synthesize RNA probes inviti:o by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labded nudeotides. These procedures may be conduded using a varidy of commercially available 
25 kits, such as those provided by Ame^ham Pharmacia Biotedi, Prorata (Madison WI), and US 
Biodiemical. Suitable rq)Qrter molecules or labds which may be used for ease of detection mclude 
radionuclides, enzymes, fluoresce diemQuminesc^, or chromogenic agents, as wdl as substrates, 
cofadors, inhibitors, magnetic partides, and the like. 

Host cdls transformed with nucleotide sequences encoding NTT may be cultured under 
30 conditions suitable for the ejcpression and recovery of the protdn fi:om cell culture. The protein 
produced by a transformed cdl may be secreted cm* retained intracdlularly depending on the sequence 
and/ or the vedor used. As will be understood by those of skill in the art, expression vectors containing 
polynudeotides whidi encode NTT may be designed to contahi signal sequences whidi direct secretion 
of NTT through a prokaryotic or eukaryotic ceH manbrane. 
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Li additioii, a host cdl strain may be chosmfor its abiUty to modulate expression of the 
inserted sequences or to process the expressed protdn in die desir^ SuchmodificatioDSoftiie 
polypq)tide include, hut are not limited to, acetylation, carboxylation, glycosylaHon, phosphorylation, 
li^idadon, and acyladon. Post-translational processing which cleaves a ''prepro" or "^o" form of the 
5 protein may also be used to q)ecily protein targeting, fol^^ Different host cells 

which have specific cdlular machinery and characteristic mechanisms for post-translational activities 
(e.g!, CHO. HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protdn. 

10 In another embodimfflit of the invention^ naturalj mnftififtd, or rft mrnh inant m^r^pir. ^n(f 

sequences encoding NTT may be ligated to a heterologous sequ^ice resulting in translation of a fusion 
protdn in any of the aforemeiitioned host systems. For example, a diimaic NTT protein containing a 
heterologous moiety that can be recogt^zed by a commerdally available antibody may facilitate the 
screaihig of peptide libraries for inhibitors of NTT activity. Heterologous protdn and peptide moieties 

15 may also facilitate purification of fusion protdns using commerdally availab^^ Such 
moieties indude» but are not limited to, ghitathione S-transferase (GST), maltose binding protdn 
(MBP), thioredoxin (Trx). calmodulin bmdmg peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin 
(HA). GST, MBP, Trx, CBP, and 6-His enable purification of tiidr cognate fusion protdns on 
immobilized glutathione, maltose^ phenylarshie oxide» calmodulhi, and m^-didate resins, 

20 respectivdy. FLAG, c-wyc, and hmagglutinin (HA) enable immunoaffinity purification of fusion 
protdns ushig commercially available monoclonal and polyclonal antibodies tiiat specifically recognize 
tiiese epitope tags. A fusion protdn may also be engineered to contain a proteolytic cleavage site 
located between the NTT encoding sequence and the hderologous prc^ sequaice, so that NTT may 
be deaved away from tiie heterologous moiety following purificatioa Methods for fusion protein 

25 expression and purification are discussed in Ausubd (1995, supra, dx 10). A variety of commorcially 
available kits may also be iised to facilitate esqiression and purification of fusion proteins. 

In a furtha: ©Dobafim^t of the inv^on, synthesis of radiolabded NTT may be achieved m 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protdn-coding sequences q)erably associated with the T7, T3, or 

30 SP6 promoters. Translation takes place in the pres^e of a radiolabeled amino add precursor, for 
exanq)le» ^S-methionine. 

NTT of tiie present invention or fragments thereof may be used to screen for compounds that 
specifically bind to NTT. At least one and up to a plurality of test compounds may be screened for 
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specific binding to NTT. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., lecqptors), or smaU molecules. 

In one ^bodiment» tbe compound thus identified is closely related to the natural ligand of 
NTT, e,g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 

5 natural binding partna:. (See. e.g.. CoUgan. JJE. et al. (1991) Current Protocols in TTnTminnln^y 1(2): 
Caiapter 5.) Similarly, the compound can be closely related to the natural receptor to which NTT 
binds, or to at least a fragment of the receptor, e.g.. the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds hivolves producing appropriate cells which ejcpress NTT, either as a secreted 

10 protdn or on the cell membrane. Preferred cells indude cells from mammals, yeast, Drosophila. or E. 
c^. Cells esqpressmg NTT or cell membrane fractions which contain NTT are then contacted with a 
test compound and binding, stimulation, or inhibition of activity of either NTT of the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
15 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound witii NTT, eiflier in 
solution or affixed to a solid support, and detecting the binding of NTT to the compound. 
Alternatively, tiie assay may detect or measure binding of a test compound in tiie presence of a 
labeled competitor. Additionally, the assay may be carried out using cen-free preparations, chemical 
20 libraries, or natural product mijctures, and the test compound(s) may be free in solution or affixed to a 
solid support 

NTT of the present invention or fragments thereof may be used to screen for compounds that 
modulate tiie activity of NTT. Such compounds may include agonists, antagonists, or partial or 
inv^se agonists. In one embodiment, an assay is performed under conditions permissive for NTT 

25 activity, wtoein NTT is combined with at least one test compound, andtheactivity of NTTinthe 
presence of a test compound is compared with tiie activity of NTT in the absence of the test compound 
A change in the activity of NTT in tiie presence of tiie test compound is indicative of a compound that 
modulates the activity of NTT. Alternatively, a test compound is combined with an in vitro or cell-free 
system comprising NTT under conditions suitable for NTT activity, and the assay is perfonoed. In 

30 dtiia: of tiiese assays, a test compound which modulates the activity of NTT may do so indirectiy and 
need not come in direct contact with the test compound. At least one and up to a plurality of test 
compounds may be screened. 

In anottier embodiment, polynucleotides ^icoding NTT or their TngmTTiaiiaii homologs may be 
"knocked out*' in an animal model system using homologous recombination in embryonic stem (ES) 
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cells. SudbLtedmiques are wdl knownln the art and are useful for the generation of animal models of 
human disease. (See, e.g., U.S. Patent Number 5.175,383 and U.S. Pat^ Number 5.767,337.) For 
Qcanq)Iei mouse ES ceiOs, such as the mouse 1 29/SvJ cell line, are derived from the early mouse embryo 
and grown in culture. The ES cells are transformed with a vector containing the gene of h^ 
5 disrupted by a marker gene, e.g., the neomycin phosphotransforasegaie (neo; C^>ecchi. M.R. (1989) 
Science 244:1288-1292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternativdy.homcflogous recombination takes place using the Cre-loxP 
systan to knockout a geoe of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clin. Invest. 97:1999-2002; Wagner, K.U et aL (1997) Nuddc Adds Res. 25:4323-4330). 

10 Transformed ES ceOs are identified and micromjected into mouse cdl blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudppregnant dams, and the 
resulting chimoic progeny are genotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding NTT may also be manipulated in vitro in ES ceils derived from 

15 human blastocysts. Human ES ceais have the potaitial to differentiate into at least ei^ sqjarate cdl 
lineages including endoderm, mesoderm, and ectodennal cdl types. These cdl lineages differentiate 
into, for example, neural cdls, hematqpoietic lineages, and cardiomyocytes (Thomson, J.A. et aL (1998) 
Sdence 282:1 145-1 147). 

Polynudeotides oicoding NTT can also be used to aeate "knoddn" humanized anhmQs (pigs) 

20 or transgedcaninaals (mice or rats) to modd human disease. Withknockintedmology, ar^onof a 
polynucleotide racoding NTT is injected into anunal ES ceQls, and the injected sequence integrates mto 
tiie animal ceaigaiome. Transformed cdls are injected into blastulae^ and the hlajsiiiiflft am im p i ^ntfid 
as described above. Transg^c progeny or inbred lines are studied and treated with potential 
pharmaceutical ag^its to obtain information on treatment of a human disease. Altemativdy, a ynsimm^i 

25 inbred to overexpress NTT, e.g., by secreting NTT in its milk, may also serve as a convenient source of 
tiiatprotdn (Janne, J. et al. (1998) Biotedmol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of NTT and neurotransmitter transporters. In addition, the e3q)ression of NTT is 
30 dosely assodated witii brain, neurological, secretory, and tumor tissue. Therefore, NTT appears to 
play a role in transport, neurological and psydiiatric disorders. In the treatment of disorders 
assodated wifli increased NTT expression or activity, it is desirable to decrease the ejqaression or 
activity of NTT. In the treatment of disordea^ assodated with decreased NTT expression or activity, 
it is desirable to increase the e3q)ression or activity of NTT. 
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Tberefore, in one embodiment, NTT or a fragment or dwivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of NTT. 
Examples of such disorders mclude, but are not limited to, a transport disorder such as aidnftRifl, 
amyotrophic laterd sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular dystrophy, 
5 Bdl's palsy, Charoot-Marie Tooth disease, diabetes memtus, diabetes hisipidus, diabetic neurqpathy, 
Duchenne muscular dystrophy, hyperkalemic periodic paralysis, notmokalemic periodic paralysis, 
Parkinson's disease, malignant hyperthMmia, multidrug resistance, myasthenia gravis, myotonic 
dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropattiy, cerebral neoplasms, 
prostate cancer; cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 

10 tachyanythmia, hypertension. Long QT syndrome, myocarditis, cardiomyopathy, nemaline 

myopathy, centronuclear myopathy, lipid myopathy, mitodiondrial myopathy, thyrotoxic myopathy, 
ethanol myopathy, dranatomyositis, inclusion body myositis, infectious myositis, polymyositis; 
neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disordo", 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schi2q>hrenia; and othar 

15 disorders associated with transport, e.g., neurofibromatosis, postheipetic neuralgia, trigeminal 

neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia. Grave's disease, goiter, 
Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hyp^cholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 

20 syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Rck's disease, Huntington's disease, dementia, Parkinson's disease and other 
extrapyramidal disord^s, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 

25 demyelinatiag diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central n^ous 
system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and G^tmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

30 encq)halotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system indudmg Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
navous system disorders, cranial nave disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral n^ous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 

35 disordars including mood, anxiety, and schizophrenic disorders, seasonM affective disorder (SAD), 
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akathesia, amnesia, catatonia, diabetic neuropatliy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; and a psychiatric disordo: such as acute stress disorder, saoohol 
dependence, amphetamine dependence, anorexia nervosa, antisocial personality disorder, 
5 attention-deficit hyperactivity disorder, autistic disorder, anxiety, avoidant personality disorder, 
bipolar disorder, bord^line personality disorder, brief psychotic disonier, bulimia nervosa, cannabis 
dq)endence, cocaine dependence, conduct disorder, cyclothymic disorier, rifttirin^n, delusional 
disorder, dementia, dq)endent personality disorder, depression, dysthymic disorder, hallucinogen 
dependence, histrionic personality disorder, inhalant dependwice, manic depression, multi-infarct 
10 dementia, narcissistic personality disorder, nicotine dq)endence, obsessive^compulsive disorder, 
opioid dependmce, oppositional defiant disorder, panic disorder, paranoid personality disorder, 
phencyclidine dependence, phobia, posttraumatic stress disorder, sdiizoaffective disorder, schizoid 
p^sonality disorder, schizophrenia, sedative dependence, separation anxiety disorder, and sleep 
disord^. 

15 In another embodiment, a vector capable of expresshig NTT or a fragmait or derivative thereof 

may be administered to a subject to treat or prevent a disorder associated with decreased expression or 

activity of NTT including, but not limited to, those described above. 

In a furthea: embodiment, a conq)osition conprising a substantially purified NTT in coiqunction 

with a suitable pharmaceutical carrio: may be administered to a subject to treat or prevait a disorder 
20 associated with decreased expression or activity of NTT including, but not Ihnited to, those provided 

above. 

In still anotha: embodiment, an agonist which modulates the activity of NTT may be 
administared to a subject to treat or prevent a disorder associated with deaeased ejqMression or activity 
of NTT including, but not limited to, those listed above. 

25 In a further embodiment, an antagonist of NTT may be administered to a subject to treat or 

prevent a disorda: associated with increased expression or activity of NTT. Examples of such 
disord^s include, but are not limited to, those transport, neurological and psychiatric disorders 
described above. In one aspect, an antibody which specifically binds NTT may be used directly as an 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 

30 cells or tissues which express NTT. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding NTT may be administered to a subject to treat or prevent a disorder associated with increased 
e^qpression or activity of NTT including, but not limited to, tbose described above. 
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In other einbodiments, any of the proteins, antagonists* antibodies, agonists, conopl^nraitary 
sequaices, or vectors of tbe inv^tion may be administered in combination witb other ^propriate 
therapeutic agents. Selection of the appropriate ag^its for use in combination tfaarq)y may be made by 
one of ordmary skill in the art, according to conventional pharmacaitical principles. The combination 
5 of therq)eatic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using ttiis approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the poteatial for adverse side ^ects. 

An antagonist of NTT may be produced using methods which are generally known in the art 
In particular, purified NTT may be used to produce antibodies or to scre^ libraries of pharmaceutical 

10 ag^ts to idaitify those which specifically bind NTT. Antibocfies to NTT may also be generated using 
methods that are wdl known in the art Such antibodies may include, but are not limited to, polyclonal, 
monodtonal, chuneric, and single chain antibodies, Fab fragments, and fragments produced by a Fab 
expression library. Neutralizing antibodies (ie., fijose wWdi mhibit dimer formation) are goierally 
preferred for ther^aitic use. 

15 For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 

and othffs may be immunized by iqection with NTT or with any fragment or oligopeptide thereof 
which has inmaunogaiic prq)erties. Dcpooding on the host spedes, various adjuvants may be used to 
maease immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 

20 polyanions, pq)tides, dl ©oaulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebactmum parvum are especially preferable. 

It is prefeired that the oligopeptides, peptides, or fragments used to induce antibodies to NTT 
have an amino acid sequence consisting of at least about 5 amhio acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 

25 identical to a portion of the amino add sequence of the natural protein. Short stretches of NTT amino 
adds may be fused with those of another protdn, such as KLH, and antibodies to the chimfflic molecule 
may be produced. 

Monodonal antibodies to NTT may be prepared using any tedmique whidi provides for the 
production of anfibody molecules by continuous cdl lines in culture. These indude, but are not limited 
30 to, the bybrldama tedmique, the human B-cdl hybridoma tedmique, and theEfiV-byhridoma 
tedmique. (See, e.g., Kohler, G. et aL (1975) Nature 256:495-497; Kbzbor, D. et aL (1985) J. 
Imnmnoi Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad ScL USA 80:2026-2030; and 
Cole, S.P. et al. (1984) MoL Cdl Biol. 62:109-120.) 
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In addition, techniques developed for the production of "dibneric antibodies/' such as the 
splicing of mouse antibocty genes to human antibody genes to obtain a mcdecule with appropriate 
antigen specificity and biological activity, can be used (See, e.g., Morrison, Si. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neubargo:, MS. et al. (1984) Nature 312:604-608; and Takeda, 
5 S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, nshig metliods known in the art, to produce NTr-q)ecific single chain 
antibodies. Antibodies with related spedbadty, but of distinct idiotypic conqKeitlon, may be generated 
by chain shufOing from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. 
(1991) Proc. NaO. Acad. Sd. USA 88:10134-10137.) 

10 Antibodies may also be produced by inducing in vivo production in the lyn5)hocyte pqpulation 

or by screening immunoglobulin libraries or pands of hi^y specific binding reagents as disclosed in 
thelita:ature. (See, e.g.. Qrlandi, R. et al. (1989) Proc. NatL Acad. Sd. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragneats whidi contain specific binding sites for NTT may also be generated For 

15 example, sudi fi:agments inchide, but are not limited to, F(ab')2 fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab')2 
fragments. Alteanativdy, Fab expression libraries may be constructed to allow rapid and easy 
. identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Sdaice 246:1275-1281.) 

20 Various nnmunoassays may be used for screendng to identify antibodies having the desired 

specifidty. Numerous protocols for conq)etitive binding or immunoradiometric assays using dthez* 
polydonal or monoclonal antibodies with established spedfidties are well Imown in Ihe art Such 
immunoassays typically involve the measurCTent of con^leK formation betweeu NTT and its specific 
antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 

25 non-intafaing NTT q)itopes is generally used, but a competitive binding assay may also be en?)loyed 
(Pound. supraV 

Various mdhods sudi as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for NTT. Affinity is ejq)ressed as an association 
constant, K,, which is defined as the molar concentration of NTT-antibody complex divided by the 
30 molar concentrations of free antigen and free antibody under equilibrium The d^ennhied 

for a prq)aration of polydonal antibodies, whidi are heterogaieous m tiieir affinities for multiple NTT 
epitq)es, represaots the av^age affinity, or avidity, of the antibodies for NTT. The drtomined for a 
preparation of monodonal antibodies, whidi are monospedfic for a particular NTT epitope, represents 
a true measure of affinity. High-affinity antibody preparations with rangmg from about 10^ to 10^^ 
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L/mole are prefenred for use in immunoassays in vMch iht NTT-antibody con^leic must withstand 
rigorous manipulations. Low-a£Bnity antibody prq)aratioiis with ranging from about 10^ to 10^ 
L/mole are preferred for use in inmmnppuritication and similar procedures which ultimately require 
dissociation of NTT, prefaably in active form, fromtiie antibody (Catty, D. (1988) Antibodies. 
5 Volume I: A Practical Approach . IRL Press, Washington DC; Lidddl, JJE. and A. Cryex (1991) A 
Practical Guide to Monoclonal Antibodies. John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody prq)arations may be furtiier evaluated to d^ermine 
the quality and suitability of such preparations for certain downstream applications. F(x example, a 
polyclonal antibocty preparation containing at least 1-2 mg specific antibodyMl, preferably 5-10 mg 
10 specific antibody/ml, is generally casployed in procedures requiring predpitatton of NTT-antibody 
con^lexes. Procedures for evaluating antibody specifidty, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coligan et al. supra .) 

In another raribodiment of the invention, the polynudeotides ^icoding NTT, or any fragm^it or 

15 con^lemenC thereof, nmy be used for therq)eutic purposes. Inoneaspect, modifications of gene 

expression can be adiieved by designing com)lemMrtary sequaices or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatOTy regions of tiie gew encoding NTT. 
Sudi tedmolQgy is well known in the art, and antisense oligonudeotides or larg^ fr agmi^ can be 
designed from various locations along the coding or contrcdregiom of sequ (See, 

20 e.g., Agrawal, S., ed. (1996) Antisense Thorapeutics. Humana Press Inc., Totawa NJ.) 

In therq)eutic use, any gene delivery system suitable for introduction of the antisense 
sequences into Expropriate target ceUs can be used. Antisense sequences can be delivered 
intraceUulariy in the form of an expression plasmid which, upon transcription, produces a sequence 
complemratary to at least a portion of the cellular sequence encodhig the target protdn. (See, e.g., 

25 Slater, JJB, et al. (1998) J. Allergy Oi, Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracdlularly tiirough the use of viral 
vectors, sudi as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A,D. (1990) Blood 
76:271; Ausubel, supra : Uckart, W. and W. Walther (1994) PharmacoL Tb^. 63(3):323-347.) Otiier 
gene delivery mechanisms include liposome-derived systems, artifidal viral envelopes, and other 

30 systems known in the art. (See, e.g,, Rossi, J J. (1995) Br. Med. Bull. 5 l(l):217-225; Boado, R. J. et 
al. (1998) J. Pharm. Sd. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nuddc Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynudeotides encoding NTT may be used for 
somatic or germline gene therapy. Gene tberdpy may be paf ormed to (i) cocrect a gaietic defidency 
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(e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized hy X-linked 
inheritance (Cavazzana-Calvo, M. a al. (2000) Sdence 288:669-672), severe ccmWned 
iimnunodefidsicy syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Sdaice 270:475-480; Bordignon. C. et al. (1995) Sdence 270:470-475), 
5 cystic fibrosis ^ner. J. et al. (1993) Cdl 75:207-216; Crystal, R.G. et al. (1995) Hum. Gaie 

Thorapy 6:643-666; Crystal, R.a et al. (1995) Hum. Gaie Hierapy 6:667-703), thalassamias. familial 
hyperdiolesterolania, and hanpphilia resulting from Factor Vm or Factor IX defidendes (Crystal, 
R.G. (1995) Sdence 270:404-410; Venna. LM, and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e^g., in the case of cancers wMdi result from unregukted 

10 cefl proliferation), or (iii) esqiress a protdn v/tidi affords protection against intracdlular parasites (e.g., 
against human retroviruses, sudi as human immunodeficiency virus (HTV) (Baltimore, D. (1988) 
Nature 335:395-396; Poesdfla, E. et aL (1996) Proc. Natl. Acad Sd. USA 93:11395-11399), 
hepamis B or C virus (HBV, HC V); fungal parasites, sudi as Candida albicans and Paracocddioides 
hrasnimQiQ: and protozoan parajgites such hr Plasmodium f aldpamm and Tr Yp annsnnifl miri) Inthe 

15 case where a geai^c defidency in NTT expression ot regulation causes disease, the e3q)ression of NTT 
from an apprcpriate population of transduced cdls may alleviate the clinical manifestations caused by 
the genetic defidency. 

In a ftirther embodiment of the invention, diseases or disorders caused by defidendes in NTT 
are treated by constructing mammalian expression vectors encoding NTT and introdudng these vectors 

20 by mechanical means into NTT-defidentcdls. Medianicaltransfartedmologiesforusewithcdlsin 
vivo or ex vitro include (i) direct DNA miaoinjection into individual ceills. (ii) ballistic gold partide 
delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the use of 
DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Anna, Rev. Biochoa 62:191-217; Mcs, 
Z. (1997) Cdl 91:501-510; Boiilay, J-L. and H. R&qwn (1998) Cuir. Opia Biotechnol. 9:445-450). 

25 Expression vectors that may be effective for the ^ression of NTT include, but are not limited 

to, thePCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH«>ERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON. PTRE2, PTRE2-LUC, PTK-HYG (Clontedi, Palo Alto CA). NTT may be expressed 
using (i) a constitutivdy active promoter. (e.g., from cytom^alovirus (CMV), Rous sarcoma virus 

30 (RSV), SV40 virus, thymidmeWnase (TIQ, or p-actin genes), (ii) an indudble promoter (e.g„ tiie 
tetracycline-regulated promoter (Gossea, M. and H. Bujard (1992) Proc. Natl. Acad. Sd. USA 
89:5547-5551; Gossen, M. et aL (1995) Sdence 268:1766-1769; Rossi, F.M.V. andH.M. Blau (1998) 
Curr. Opin. Biotedmol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-indudble promotar (available in the plasmids PVGRXR and FIND; Invitrogea); the 
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FKS06/rapamycin inducible promoter; or the RU486/niif epristone inducible promoter (Ro8Si» F.M. V. 
and Blau, H.M. supra) \ or (iii) a tissue-spedfic promoter or tbe native promoter of the endog^ous 
gene encoding NTT from a normal individual. 

Commerdany available liposome transformation kits (e.g.» tbe I^RFECT LIPID 
5 TRANSEECTION KIT, available from Livitrogen) allow one witb ordinary skill in the art to deliver 
pdynucleotides to target ceSls in culture and require minimal effort to optimize experimental 
parameters. In tbe alternative, transformation is performed using tbe calcium phosphate method 
(Graham, Fi. and A. J. Eb (1973) Virdpgy 52:456-467), or by dectroporation (Neumann, E. et al. 
(1982)EMBOJ. 1:841-845). Theintroductianof DNA to primary cdls requires modification of these 

10 standardized mammalian transfection protocols. 

In another embodiment of the inv^on, diseases or disorders caused by g^aetic defects with 
respect to NTT e3q)ression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide enoodmg NTT undo: the control of an ifidepoident promoter or the retrovirus long 
terminal lepest (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 

15 d^n^ (RRE) along with additional rdrovirus ci^-acting RNA sequences and co(Sng sequences 
required for effident vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
conmiercially available (Stratagoie) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sd. USA 92:6733-6737), incorporated by referaoce herein. The vector is propagated in an 
appropriate vector producing ceill line (VPCL) that ejqjresses an envdqpe gene with a tropism for 

20 receptors on the targrt ceils or a promiscuous awdppe protdn such as VS Vg (Annemtano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A et aL (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
AD. Miner (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910.434 to Rigg ("Method for obtaining 
retrovirus packaging cdl lines producing higji transducing effidency retroviral supernatant") disdoses a 

25 m^hod for obtaining retrovirus padcagmg cdl lines and is herdjyincor^ referoice. 

Propagation of retrovirus vectors, transduction of a pcpdation of cdls (e.g., CD4* T-cdls), and the 
return of transduced ceffls to a pati^ are procedures weSl known to persons skOled in the art of gaie 
ther^y and have bemwdl documaited (Ranga, U. et aL (1997) J. Virol. 71:7020-7029; Bauer, G. dt 
al. (1997) Blood 89:2259-2267; Bon3«iadi, MX. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 

30 (1998) Proc. Natl. Acad Sd. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative^ an ad^virus-based gene thers^y deliv^ syst^ is used to ddiver 
polynucleotides mcoding NTT to cdls which have one or more genetic abnormalities with respect to Qie 
expression of NTT. The construction and packaging of adeoovirus-based vectors are wdl known to 
those with ordinary skill in the art Rq)lication defective adeno\drus vectors have proven to be versatile 
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for importing genes encoding inmninoregulatQry protdns into intact islets in the pancreas (Csete» M.E. 
et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. 
Patent Number 5,707,618 to Araoentano C'Adenovirus vectors for gene therapy"), herdby incorporated 
by refer^ice. For adenoviral vectors, see also Antinozzi, P.A. et aL (1999) Annu. Rev. Nutr. 
5 19:51 1-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, both incoiporated by 
rtferm:eherda 

In another alternative, a heatpes-based, gene therapy delivery system is used to ddiver 
polynucleotides ^coding NTT to targ^ cdls which have one or more genetic abnormalities with respect 
to the eaqpression of NTT. Hie use of herpes simplex virus (HSV)-based vectors may be espedally 

10 valuable fcff introducing NTT to cdls of the central naroussysten^ The 
construction and packaging of herpes-based vectors are weU Imown to those with ordinary skill in the 
art. A rq)lication-comp^:ent herpes sin^plex virus (HSV) type 1-based vector has beea used to deliver a 
rqxMter gene to the eyes of primates (liu, X. al. (1999) Exp. Eye Res. 169:385-395). The 
construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent Number 5,804,413 

15 to DeLuca ("Herpes smq)lex virus strains for gene transfer"), whidi is hereby incorporated by 

rderence. U.S. Patent Number 5,804,413 teaches the use of recombinant HSV d92 which consists of a 
goiome containing at least one exogenous gene to be transferred to a cdl under the control of the 
appropriate promoter for purposes including human gene lherq)y. Also taught by this patent are the 
construction and use of recombinant HSV strains deleted far ICP4, IC3P27 and ICP22. For HSV 

20 vectors, see also Coins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et aL (1994) Dev. Biol. 
163:152-161, herd)y incorporated by reference. The manipulation of cloned herpesvirus sequences, the 
generation of recombinant virus following the transfection of mu]tq}le plasmids containing different 
segments of the large herpesvirus genomes, the growth and propagation of herpesvirus, and the 
infection of cells with heapesvirus are techniques weU known to those of ordinary skill in the art. 

25 In another alternative, an a^)havirus (positive, single-stranded RNA virus) vector is used to 

deaivea: polynucleotides aicoding NTT to target cells. The biology of the prototypicalphavirus, Semliki 
Forest Wus (SFV), has beai studied extensively and gene transfer vectors have been based on the SFV 
gwiome (Garoff, H. and K-J. Li (1998) Curr. Opia BiotedmoL 9:464-469). During alphavirus RNA 
rq)lication, a subgenomic RNA is generated that normally aicodes the viral capsid proteins. This 

30 subgoiomic RNA rq)licates to higha levels than the full length genomic RNA. resulting in the 

overproduction of cs^sid protdns rdative to the viral protdns with enzymatic activity (e.g., protease 
and pdynMase). Similarly, inserting the coding sequence for NTT into the alphavirus genome in place 
of the capsid-coding region results in the production of a large number of NTT-coding RNAs and the 
synthesis of high levds of NTT in vector transduced cells. While alphavirus infection is typically 
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associated with cdllysis witiiin a few days, tlie ability to establish a persists 
nonnal kidney cdls (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication 
of alphaviruses can be altered to suit the needs of the gene ther^y application (Dryga, S. A. et al. 
(1997) Virology 228:74-83). The wide host range of alphaviruses will allow the mtroduction of NTT 
into a variety of cell types. The specific transduction of a subset of ceils in a population may require 
the sorting of ceiDs prior to transductioa Tbeniethods of manipulatmg infectious cDNA clones of 
alphaviruses, performing alphavirus cDNA and RNA transfections, and perfonning alphavirus 
infections, are wdl known to those with ordinary skill in the art 

Oligonucleotides derived from the transcription initiation site, &g., between about positions -10 
and +10 frona the start site> may also be ai5>loyed to inhibit gene e3q)ressioa Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple hdix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Rec^ therapeutic advances using triplex DNA have 
be^ desoribed in the literature. (See, e.g.. Gee, J£. et aL (1994) in Huba:, B.E. and B.L Carr, 
Molecular an d rmmiinnl ogic Approaches. Futura Publishing, Ml Kisco NY, pp. 163-177.) A 
con^lemaotary sequsice or antis^e molecule may also be designed to block translation of mRNA by 
prevaiting the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The me c h anis m of ribozyme action involves sequmce-spedfic hybridization of the ribozyme 
molecule to con5)lanentarytargrt RNA, foflowed by endonucleoSlytic cleavage* Forexan5)le, 
epgineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequ^ices encoding NTT. 

Specific ribozyme deavage sites within any potaitial RNA target are initially identified by 
scanning the targ^ molecule for ribozyme deavage sites, induduig the following sequaices: GUA, 
GUU, and GUC. Once identified, short RNA sequ^ices of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site^ may be evaluated for 
secondary structoral features whidi may render the oligonucleotide moperable. The suitability of 
candidate targets may also be evaluated by testtag accessibility to hybridization with complementary 
oligonudeotides using ribonudease protection assays. 

Complementary ribonuddc add molecules and ribozymes of ttie invention may be prepared by 
any method known in the art for the synthesis of nuddc addmcdecules. These include tedmiques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chcanical synthesis. 
Alternativdy, RNA molecules may be gaierated by invibro and in vivo transcription of DNA sequences 
eocodtogNTT. SudiDNAsequencesmaybeincoqKratedintoawidevariety of vedors withsui 
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RNA polymerase promoters sudi as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA. constitutlvdy or indudbly, can be introduced into cdl lines, ceJls, or tissues. 

RNA molecules may be modified to increase intraceQular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
5 of the molecule, ortheuseofphosphorolMoateor 2'0-niethylratherlhanpliospta 

within the backbone of the mdecule. This except is inherent in Ibe production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine* as wdl as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

10 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding NTT. Compounds 
which may be effective in altmng expression of a specific polynucleotide may mclude, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 

15 chemical entities which are cq)able of hiteracting with specific polynucleotide sequaices. Effective 
compounds may alter polynucleotide e:q)ression by acting as eithra" inhibitors or promoters of 
polynucleotide ejqMcession. Thus, in the treatment of disorders associated with increased NTT 
repression or activity, a compound which specifically inhibits repression of the polynucleotide 
encoding NTT may be then5)eutically useful, and in the treament of disorders associated with 

20 decreased NTT expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding NTT may be therapeutically useflil. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
conmionly known in the art, including chemical modification of a compound known to be effective m 

25 altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a con5M)und 
based on chemical and/or structural properties of the target polynucleotide; and selection fi-om a 
library of chemical compounds aeated combinatorially or randomly. A sample comprising a 
polynucleotide encoding NTT is exposed to at least one test compound thus obtained. Ihe sample 

30 may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding NTT are assayed by 
any method commonly known in the art. TypicaUy , the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encodmg NTT. The amount of hybridization may be quantified, thus forming the 
35 basis for a comparison of the expression of the polynucleotide both with and without e;qposure to one 
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or more test compounds. Detection of a change in ttie ejqjression of a polynucleotide e^qposed to a 
test confound indicates lhat the test compound is effective in alt^±tig the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example* using a Schizosaccharomvces pombe gene expression system (Atkins, 
5 D. et al. (1999) U.S. Patent No. 5,932,435 ; Amdt, GM. et al. (2000) Nucleic Acids Res. 28:E15) or a 
human cell line such as HeLa cdl (Qarke, M.L. et al. (2000) Biochan. Biophys. Res. Commun. 
268:8-13). A particular embodiment of the present invention involves screening a combinatorial 
library of oligonucleotides (sudi as deoxyribonudeotides. ribonucleotides, peptide nucleic acids, and 
modified oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bruice, 

10 T.W. et al. (1997) U.S. Patent No. 5,686.242; Bruice, T.W. et al. (2000) U.S. Patent No. 6.022,691). 

Many methods for introducing vectors mto cdls or tissues are available and equally suitable for 
usetej352,injvtoo,anda\4vo. For aydvoteapy. vectors may be hitioduced into stem cdls tak^ 
from the patient and clonally propagated for autologous transplant back into that same patieoL 
Ddivery by transfection, by liposome mjections, or by polycationic amiiK) polymers may be achieved 

15 using methods which are wdl known in the art (See, e.g., Goldman. C.K. et al. (1997) Nat 
Biotechndl. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, mcludmg, for exan?)le, mammals sudi as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

20 An additional embodunait of the invojtion relates to the administration of a conqxjsrtion which 

generally conprises an active higredient formulated witii a pharmaceutically accq)table excqpient 
Excipients may include, for exan5)le, sugars, starches, ceUuloses, gums, and protdns. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Rmuagl^ 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of NTT, 

25 antibodies to NTT, and mimetics, agonists, antagonists, or hihibitors of NTT. 

The compositions utilized in this invention may be administered by any nunaber of routes 
including, but not limited to, oral, infravwious, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventiicular, pulmonary, transdermal, subcutaneous, intraperitoneal, hUranasal, enteral, topical, 
sublingual, or rectal means. 

30 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are g^ierally aerosolized immediatdy prior to inhalation by the pati^ In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art. In tiae case of maaomolecules (e.g, larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar region of the lung have enabled 
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the practical delivery of drags such as insulin to blood circulation (see, e.g.. Patton. J.S. et aL, U.S. 
Patait No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
iqection, and obviates the need for potentially toxic p^ietration oflianc^s. 

Conq)ositions suitable for use in the invention include conipositions wherein the active 
5 ingredients are contained in an effective amorat to acMeve the inta The determination of 

an effective dose is well within the capability of those skilled in the art 

Specialized forms of compositions may be prqpared for direct intracdlular ddivery of 
maaomdecules con5)rising NTT or fragments thereof. For example, liposome preparations containing 
a cdQ-impermeable maaomoLecule may promote cell fusion and intracellular delivery of ttie 

10 macromolecule. Altemativdy, NTT or a fragment thereof may be joined to a short cationic N-tmninal 
portion from the HIV Tat-1 proteia Fusion proteins thus generated have been found to transduce into 
the cdls of all tissues, including the brain, in a mouse model systmi (Schwarze, S.R, al. (1999) 
Sdeaice 285:1569-1572). 

Fcm: any compound, tlie therapeutically effective dose can be estimated initially dther in cell 

15 culture assays, ag., of neoplastic cells, or in animal modds sucb as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of admmistratioa Sucii information can &en be used to detennine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example NTT or 

20 fragments thereof, antibodies of NTT, and agonists, antagonists or inhibitors of NTT, which 
ameliorates the syn?)toms or conditioa Hierapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cdl cultures or with eq}erimental animals, such as by 
calculating the ED50 (the dose therapeutically effective in 50% of the population) or LD50 (thedose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is tiie 

25 tha*25)eutic index, wMdi can be repressed as the LD50/ED50 ratio. Compositions which exhibit large 
therjqpeutic indices are preferred. The data obtained from ceil culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such con?)ositions is 
preferably witiiin a range of circulating concentrations ttiat includes the EDso witii litfle or no toxicity. 
The dosage varies within this range depaiding upon tiie dosage form employed, the sensitivity of the 

30 pati^ and the route of administration. 

The exact dosage win be d^ermined by the practitioner, in light of factors related to tiie subject 
requiring treatmait Dosage and admhiistration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired ^ect. Factors wMchnmy be takeoi into account include the seventy 
of flie disease state, the geno-al health of the subject, the age, weight, and gender of the subject, time 
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and frequency of administration, drug combinatiCffl(s), reaction sensitivities, and response to tha-apy . 
Long-actiog con?)ositions may l)e administered every 3 to 4 days, every week, or biweddy depending 
on the half-life and clearance rate of the particular f onnulatioa 

Nonnal dosage amounts may vary from about 0.1 /^g to 100,000 /zg, iq) to a total dose of 
5 about 1 gram, dq)ending upon the route of administratioa Guidance as to particular dosages and 
methods of deaivoy is provided in the literature and generally available to practitioners in the art 
' Those skilled in the art will employ different formuiatiQns for nucleotides than for protdns or their 
inhibitors. Similarly, ddivery of polynucleotides on polypeptides will be specific to particular cdls, 
conditions, locations, etc, 
10 DIAGNOSTICS 

In anotho: embodimant, antibodies which specifically bind NTT may be used for the diagnosis 
of disorders characterized by expression of NTT, cmt in assays to monitor patients bdng treated with 
NTT or agonists, antagonists, or inhibitors of NTT. Antibodies useful for diagnostic purposes may be 
prqpared m the same manner as described above for therapeutics. Diagnostic assays for NTT include 

15 methods which utilize the antibody and a label to detect NTT in human body fluids or in extracts of 
cells or tissues, llie antibodies nmy be used with or vwthout modification, and niay be labded by 
covalent or non-covalent attachmait of a rqxirta: uKdecule. A wide variety of rq)Qrter molecules, 
several of whidi are described above, are known in the art and may be used. 

A variety of protocols fac measuring NTT, including ELISAs, RIAs, and FACS, are known m 

20 the art and provide a basis for diagnosiog altered or abnonnal levels of N^ NcrmalOT 
standard values for NTT expression are established by combining body fluids or ceil extracts taken 
from normal mammalian subjects, for exanqde, human subjects, with antibodies to NTT under 
coiKlitions suitable for conq)lexfQrniatioa TbeanK)umof standard conq)lex formation may be 
quanfitated by various methods, such as photometric means. Quantities of NTT expressed in subject, 

25 confrd, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnnRing disease. 

In another embodimait of the invention, the polynucleotides encoding NTT naay be used for 
diagnostic purposes. The polynucleotides which may be used hiclude oligonucleotide sequences, 
con^flemaatary RNA and DNA molecules, and PNAs. The pdynucleolides may be used to detect and 

30 quantify gene ejpression in biopsied tissues in which ejqpression of NTT may be corrdated with 
disease. The diagnostic assay may be used to determine absence, presence* and excess expression of 
NTT, and to monitor regulatiCMi of NTT levels during therapeutic interventioa 

lu one aspect, hybridization with PGR probes which are enable of detectmg polynucleotide 
sequences, indudmg genomic sequences, ©ocoding NTT or closely related molecules may be used to 
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identify nuddc add sequences wMdi oioode NTT. The specificity of tbe prdbe, whetbo: it is made 
from a highly specific region, eg., the 5'regulatory region, or from a less specific region, e.g., a 
conserved motif, and tiie stringwicy of the hybridization cm: an?)]ification win determine whether the 
probe identifies only naturally occurring sequaices encoding NTT, allege variants, or related sequences. 
5 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of theNTT mcoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be doived from the sequence of SEQ ID NO:7-12 or from 
goiomic sequQDces including promoters, aihancers, and introns of the NTT gene. 

Means for producing specific hybridization probes for DNAs aicoding NTT include the 
10 donmg of polynucleotide sequwices encoding NTT or NTT derivatives into vectors for the production 
of mRNA probes. Such vectors are Jmown in the art, are commarcially available^ and may be used to 
synthesize RNA probes inj^ by means of the addition of the appropriate RNA polymerases and the 
appropriate labded nucleotides. Hybridization probes may be labded by a vari^ of reporter groups, 
for example, by radionucHdes such as or or by enzymatic labds, sudi as alkaline phosphatase 
15 coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences OKXding NTT may be used for the diagnosis of disorders associated 
with e3q)ressiQn of NTT. Exan[?)les of such disorders indude, but are not lunited to, a transport 
disorder sudi as akinesia, amyotrophic lat^ sderosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bdl's palsy, Charcot-Maiie Tooth disease, diabetes mellitus, diabetes msipidus, 
20 diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic 
periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, pmpheral neuropathy, cerebral 
neoplasms, prostate canc^; cardiac disorders associated with transport, e.g., angina, bradyanythmia, 
tadiyarrythmia, hyp^nsion. Long QT syndrome, myocarditis, cardiomyopafliy, nemaline 
myopattiy, centronuclear myopathy, lipid myopathy, mitochondrial myopattiy, tiiyrotoxic myopattiy, 
ethanol myopathy, dOTnatomyositis, inclusion body myositis, infectious myositis, polymyositis; 
n^irologlcal disorders associated with transport, e.g., Alzhdmar's disease, amaiesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and sdiizpphrenia; and ottier 
disorders assodated with transport, e.g.. neurofibromatosis, postiierpetic neuralgia, trigemmal 
neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia. Grave's disease, goit^, 
Cushmg^s disease, Addison's disease, glucose-galaaosemdabsoiption syndrome, 
hypercholesta:olemia, adrenoleukodystrophy,'2:ellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
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neurological disorto such as epilepsy, isdiemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
^ctrapyramidal disorders, amyotrophic lateral sd^osis and oth^ motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and olher 
5 demydinatuig diseases, bactmal and viral meningitis, brain abscess, subdural empyema* epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
system disease, prion diseases including kuru, Qeutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous scl»x>sis, cerebelloretinal hemangioblastomatosis, 

10 eQcq)halotrigeminal syndrome, mental retardation and other developmental disorders of tiie central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, aanial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inhaited. metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 

15 disorders including mood, anxiety, and schizophrenic disord^s, seasonal affective disorder (SAD), 
akafliesia, anmesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disordo:, progressive supranudear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; and a psydiiatdc disorder such as acute stress disorder, alcohol 
dq)endence, amphetamine dependence, anorexia nervosa, antisocial personality disorder, 

20 attention-deficit hyperactivity disorder, autistic disorder, anxiety, avoidant personality disorder, 
bipolar disorder, borderline po-sonality disorder, brief psychotic disorder, bulimia nervosa, cannabis 
dependence, cocaine dependence, conduct disorder, cyclothymic disorder, delirium, deliisional 
disorder, dementia, dependent personality disorder, depression, dysthymic disorder, hallucinogen 
dependence, histrionic personality disorder, inhalant dependence, manic depression, multi-infarct 

25 dementia, narcissistic personality disorder, nicotine dependence, obsessive-compulsive disorder, 
opioid dependence, oppositional defiant disorder, panic disorder, paranoid personality disorder, 
phencyclidine d^ndence, phobia, posttraumatic stress disorder, schizoaffective disorder, schizoid 
personality disorder, schizoptirenia, sedative dependence, separation anxiety disorder, and sleep 
disorder. The polynucleotide sequeuces encoding NTT may be used in Southern or northern analysis, 

30 dot blot, or other membrane-based technologies; in PGR technologies; in dipstick, pin, and multiformat 
ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to detect altered NTT 
expression Such qualitative or quantitative methods are well known in tiie art. 

In a particular aspect, the nucleotide sequences encoding NTT may be useful in assays that 
detect the preseice of associated disorders, particularly those meitioned above. The nudeotide 

35 sequences eicoding NTT may be labded by standard methods and added to a fluid or tissue sample 
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fram a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, tbe sanqile is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in con?)arison to a control 
san5)le Hnsa the presaice of altered levds of nucleotide sequences encoding NTT m flie smplt indicates 
5 the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a 
particular Iher^^ajtic treatment regimen in animal studies, in dmical trials, or to monitor the treatment 
of an individual patient 

In order to provide a basis for the diagnosis of a disorder associated with expression of NTT, a 
normal or standard profile for e3q)ression is established. This may be accomplished by combining body 

10 fluids or cell extracts taken firom normal subjects, either animal or human, with asequeoce, or a 

fragment thereof, oocoding NTT, under conditions suitable for hybridization or amplificatioa Standard 
hybridization may be quantified by comparing the values obtained from normal subjects with values 
from an experiment in whidi a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be con?)ared with values obtained from samples from 

15 patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocd is mitiated, 
hybridization assays may be rq)eated on a regular basis to determine if the level of expression in the 
patient begins to approxunate that which is observed in the normal subject. Hie results obtained from 

20 successive assays may be used to show the efficacy of freatm^t over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transaipt (either under- or 
ovarexpressed) in biopsied tissue from an mdividual may indicate a predisposition for the devdppment 
of the disease, or may provide a means for detecthig the disease pricM: to the appearance of actual 

25 clinical symptoms. A more deflnitive diagnosis of this type may allow health professionals to eii5)loy 
preventative measures or aggressive treatment earlier thereby preventu^g the develcpment or fiffflier 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding NTT may 
involve the use of PGR. These oligomars may be Ghenolcally synthesized, generated enzymatically, or 

30 jH-oducedJnjdfro. OUgomers win preferably contain a fragment of a pdynucleotideei^^ 
a fragmoQt of a polynucleotide complementary to the polynucleotide encoding NTT, and will be 
en^iloyed undo: optunized conditions for idaitification of a specific gene or conditioa Oligomo^ may 
also be employed under less stringent conditions for detection or quantificatioii of closely rdated DNA 
or RNA sequences. 
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In a particular aspect, oligonucleotide primers denved from the polynucleotide sequences 
encoding NTT may be used to d^ect single nucleotide polymorphisms (SNPs). SNPs are substitutions, 
insertions and deletions that are a frequent cause of intierited or acquired genetic disease in humans. 
M^ods of SNP detection include, but are not limited to, single-stranded conformation polymorphism 
5 (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from the 
polynucleotide sequences encoding NTT are used to amplify DNA using the polymerase chain reaction 
(PGR). The DNA may be derived, for exanq)le, from diseased ch: normal tissue, biopsy samples, bodfly 
fluids, and the like. SNPsintheDNAcausediffer^icesinthesecondary and tertiary strictures of P^ 
products in singje-stranded form, and these differaices are detectable using gd electrophoresis in non- 
10 denaturing gds. In fSCCP, the oligonucleotide primers are fluorescenfly labeled, which allows 
detection of th& amplimers in high-througl^ut equipment such as DNA sequencing mflrh<nft<? 
Adcfitionally, sequaice database analysis mdhods, tamed in silioo SNP (isSNP), are cq)able of 
idaitifyUig polymorphisms by coniparing the sequ^ce of individual overlapping DNA fragments which 
assemble into a common consensus sequencew These con;>uter-based methods filter out sequence 
15 variations due to laboratory prq)aration of DNA and sequencing errors using statistical models and 
automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for exan5)le, the high tiiroughput MASSARRAY system 
(Sequenom, Inc., San Diego CA). 

Mettiods which may also be used to quantify the expression of NTT include radiolabeiing or 
20 biotinylating nucleotides, coamplification of a control nucleic add, and interpolating results from 

standard curves. (See, e.g., Mdby, P.C, et al. (1993) J. Immunol. Mettiods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochan. 212:229-236.) The speed of quantitation of mult?)le samples may be 
accelerated by running the assay in a high-tiiroughput format whore the oligomer or polynucleotide of 
interest is presCTled in various dilutions and a spectrophotometric or colorimetric response gives rs^id 
25 quantitation. 

In further embodim^, oaigonudeotides or longer fragment^ derived from any of the 
polynucleotide sequences described herein may be used as doneots on a microarray. The microairay 
can be used in transaipt imaging techniques which monitor the relative e!q)ression levels of large 
numbers of genes simultaneously as described below. The miaoarray may also be used to identify 
30 genetic variants, mutations, and polymorphisms. This infonnation may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in tiie treatment of disease. In particular, this information may be used 
to devdqp a pharmacogeaomic profQe of a patient in order to sdect the most appropriate and effective 
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treatmi^ regiineD for that patient. For exaiq}le» ther apeulLc agents which are highly ea^ective and 
display tbt fewest side dfects may be selected for a patient based on his/her phannacogeDDndc profile. 

In another enobodiinent, NTT, fragments of NTT, or antibodies specific for NTT may be used 
as elements on a miCToairay. The mioroarray may be used to monitor or measure proteui-protdn 
5 interactions, drug-target intaracttons, and gene expression profiles, as described above. 

A particular onbodiment rdates to the use of the polynudeotidaj of Ibe present invention to 
gMierate a transo^ image of a tissue or cdl type. A transcript hnage rq)resents the global pattern of 
gene esqwression by a particular tissue or cedl type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under givm conditions and at a 

10 givaitinae. (See Sdlhama: et aL, "Con5)arative Gene Transcript Analysis," U.S. Patent Numba- 
5,840,484, ejqpressly incorporated by reference herdn.) Thus a transcript image may be generated by 
hybridizing the polynuclet^des of tbe present invCTfion or their con5)lanents to tbe totality of 
transcripts or reverse transcripts of a particular tissue or centyp^ Inoneembodim^ the 
hybridization takes place m high-throughput format, wherein the polynucleotides of the pres^ 

15 invention or their complonents con5)rise a subset of a plurality of dements on a microarray. The 
resultant transcript image would provide a profile of g^ activity. 

Transcript images may be goierated using transcripts isolated from tissues, cdl lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene eqaression in vivo, as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a ceU line. 

20 Transcript images which profile the expression of the polynucleotides of the present invention 

may also be used in conjunction with in vitro modd systems and preclinical evaluation of 
pharmaceuticals, as wdl as toxicological testing of industrial and naturally-occurring enviromnental 
conqwunds. All compounds induce characteristic gene expression patterns, frequenfly teamed 
DKilecular fingerprints or toxicant signatures, which are indicative of niechanisms of action and toxicity 

25 (Nuwaysir, E.F. et al. (1999) Mol. Cardnog. 24:153-159; Stdner, S. and N.L. Anderson (2000) 

Toxicol Lett. 112-1 1 3:467-471, expressly incorporated by referoice herdn). If a test compound has a 
signature similar to that of a cQn?)ound witii known toxicity, it is likdy to share those toxic properties . 
These fingerprints or signatures are most useful and r^ned whai ibsy contain expression information 
firom a large numbto of genes and gem families. Ideally, a genome-wide measurement of esqpression 

30 provides tiie highest quality sipature. Even g^es whose ejqnression is not altered by any tested 
compounds are unportant as wdl, as the levels of e3q)ression of tiiese genes are used to normalize the 
rest of the expression data. The normalizatton procedure is usdiil for oon^)arison of expression data 
after treatment with different compounds. Whiletheassignmentof gene fimction to dements of a 
toxicant signature aids in inteq>retation of toxicity mechanisms, knowledge of gene function is not 
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necessary for the statistical matching of signatures wbidi leads to prediction of toxicity. (See, for 
example^ Press Release 00>02 from tlie National Institute of Environm^tal Healtii Sci^ices, released 
February 29, 2000, available at littpy/wvw.niete.nih.gov/oc/news/taxc^^ Therefore^ it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
5 sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucMc acids with the test compound. Nucleic adds that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynudeotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 

10 invOTtion may be quantified. The transcript levels m the treated biological sample are compared with 
levels iu an untreated biological sample. Differences in the transcript levels betweeai the two samples 
are indicative of a toxic response caused by the test compound in the heated sample. 

Another particular embodiment relates to the use of the polypq)tide sequences of the present 
invention to analyze the proteome of a tissue cm: cell type. The termproteome refers to the global 

15 pattern of protein expression in a particular tissue or ceill type. Eadi protdn coiiq)onent of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying the number of expressed proteins and their rdative abundance under given 
conditions and at a givai time, A profile of a cdl's proteome may thus be generated by sq)arating and 
analyzmg the polypq)tides of a particular tissue or cdl type. In one embodimoit, the separation is 

20 adiieved using two-dimensional gd dectrophoresis, in which proteins firom a sanqple are separated by 
isodectric focusmg m the first dimaision, and thai according to molecular wdght by sodium dodecyl 
sulfate slab gd dectrophoresis hi the second dunension (Steiner and Anderson, sunraV Theprotdnsare 
visualized in the gd as discrete and uniqudy positioned spots, ^ically by stainmg the gd with an agent 
such as Coomassie Blue ot silver or fluoresceat stams. The optical density of eadi protein spot is 

25 geaofirally proportional to the levd of the protem m the san5)le. The optical daisities of equivalentty 
positioned protein spots from different samples, for example^ from biological samples either treated or 
untreated with a test con^jound therapeutic ag^ are con^pared to id^itify any dianges in protean 
spot density rdated to the treatment The protdnsm the spots are partially sequaK:edushig, for 
exanaple, standard mdhods employing diemical or enzymatic cleavage followed by mass spectromdry. 

30 The idoitity of the proldn in a spot may be determined by con5)arii^git^ 

at least 5 contiguous amino add residues, to the pQlypq)tide sequences of the present inveaotioa In 
some cases, fiirther sequ^ice data may be obtained for definitive protein identification. 

A proteomic profile may also be graerated usmg antibodies spedfic for NTT to quantify the 
levels of NTT expression. In one embodiment, the antibodies are used as danoits on a microarray, and 
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protdn escpressioa levels are quantified by exposing tbe mic^oarray to the sample and detecting fbe 
levels of protein bound to each array desnent (Luddng, A et al. (1999) Anal. Biochan. 270:103-111; 
Mendoze* L.a et aL (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for exainple* by reacting the protdns in the san9)le with a thiol- or amino- 
5 reactive fluorescent con^wund and detecting the amount of fluorescence bound at each array dement 

Toxicant signatures at the proteome levd are also useM for taxicdogical screening, and should 
be analyzed in paraDd with toxicant signatures at the transcaiptlevd. Thareis apoorcorrdation 
betwe^ transcript and protein abundances for some protdns in some tissues (Anderson, Ni. and J. 
Sdmamffl" (1997) Hectrqpharesis 18:533-537), so proteome toxicant signatures may be useful in the 

10 analysis of con^xninds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transorlpts in bo(fy fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more rdiable and informative in such cases. 

In anotho: embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing protdns with the test compound. Proteins that are expressed in the treated biological 

15 sample are sqparated so that the amount of eadiprotdtt can be quantified. The amount of each protean 
is compared to the amount of the corresponding protean in an untreated biological sample. A difference 
in the amount of protdn between the two sanq)les is indicative of a toxic response to the test cQm)ound 
in the treated sampla Individual protdns areidoitifiedby sequendng the amino add residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 

20 In another embodiment the toxidty of a test con?)ound is assessed by treating a biological 

san?)le containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypq}tides of the present inv^oa Ihe amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is con^pared with 
the amount in an untreated biological sample. A diffarence in the amount of protdn between the two 

25 samples is indicative of a toxic response to the test conopound in the treated sanqde. 

Nficroarrays may be prepared, used, and analyzed using mi^hods known in the art (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Sdiena, M. et al. (1996) Proc. Natl. Acad Sd. 
USA 93:10614-10619; Baldeschwdler et sd. (1995) PCT application W095/25 11 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Hdler, R.A. et al. (1997) Proc. Nati. Acad. Sd. USA 94:2150- 

30 2155; and Hdler, M.J. et aL (1997) U.S. Patent No. 5,605,662.) Various types of microairays are wdl 
known and thoroughly described in DNAMicroarravs: A Practtcal Approach. M. Schena, ed (1999) 
Oxford University Press, London, herd)y expressly incorporated by reference. 

In another embodimfint of the inv^on, nuddc acid sequences encoding NTT may be used to 
geoerste hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
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coding or noncodiBg sequences may be used, and in some instances, noncoding sequences may be 
preferable ov^ coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mq^ping* The sequences may be ms^yped to a particular (^omosonie, to aspect 
5 chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bactmal artificial dbromosomes (BACs), bacterial PI 
constructions, or single c±iromosome cDNA libraries. (See, e*g., Harrington, J.J, et aL (1997) Nat 
Goiet 15:345-.355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trrads 
7:149-154.) Once mapped, the nucldc add sequences of the invoition may be used to develop genetic 

10 linkage maps, for example, which corrdate the inhadtance of a disease state witii the inheritance of a 
particular diromosome region or restriction fliagment length pcflymoiphism (RFLP). (See, for 
example, Lander, E.S. and D. Botstein (1986) Proc. Nati, Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (PISH) may be corrdated witii otiier physical and genetic map 
data. (See, e^g., Hdnz-Ulridi, et al. (1995) in Meyers, supra> pp. 965-968.) Examples of genetic map 

15 data can be found hi various sdoitific journals or at tiie Online Menddian Inheritance hi Man (OMIM) 
World Wide Wd) site. Cosrrdation between the location of the gene encoding NTT on a physical map 
, and a specific disorder, or a predisposition to a specific disorder, may hdp define the region of. DNA 
associated with that dis(»:der and tiius may furtiier positional donmg efforts. 

Ihsitu hybridization of chromosomal prq)arations and physical mapping tediniques, sudi as 

20 linkage analysis using established dbcromosomal markers, may be used for extending genetic maps. 
Often the placemont of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers evai if the exact chromosooDMd locus is not kn^ This information is 
valuable to investigators searchmg for disease gexies using positional cloning or ottier gene discovoy 
tediniques. Once the gene or genes responsible for a disease or syndrome have been cruddy localized 

25 by gendic linkage to a particular genomic region, e.g., ataxia-tdangiectasia to 1 lq22-23, any sequaices 
mappmg to that area may represent assodated or regulatcffy genes for further investigatioa (See, e.g., 
Gatti, R. A et al. (1988) Nature 336:577-580.) The nudeotide sequence of the tostant hrvention may 
also be used to detect diffarences in the diromosomal location due to translocation, inversion, etc., 
among nomal, carrier, or affected individuals. 

30 ' hianottifireambodhnentaf11iemv^on,NTT,itscatdyticarhn^ 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drag 
screening tediniques. The fragment employed in sudi screening may be fi-ee in solution, affixed to a 
solid support, borne on a cdl surface, or located intracdlularly. The formation of bindmg complexes 
betwe^ NTT and the agent bdng tested may be measured. 
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Another technique for drag screeniiig provides for high tiirougl^mt screening of con5>ounds 
having suitable binding affinity to tiie protein of inteest (See, e.g.. Geysoi, et aL (1984) PCT 
application WO84/03564.) In this method, large numbers of differeant small test compounds are 
synthesized on a solid substrata The test conq)ounds are reacted with NTT, or fragments thftrftnf ^ aufi 
5 washed. Bomid MT is theu detected by mettwds wen Imown in tb^ Purified NTT can also be 
coated directly onto plates for use in the aforem^oned drag screening techniques. MternativeLy, 
non-naitralizing antibodies can be used to capture the peptide and immobilize it on a solid support 

In another snbodiinent, one may use comp^itive drag screening assays in which neutralizing 
antibotfies capable ofbinding NTT spedficallyconq)^ with a test compound f^ In this 

10 manner, antibodies can be used to detect the presence of any pq)tide whidi shares one or more antigenic 
determinants with NTT. 

In additional embodiments, the nucleotide sequoices yMch encode NTT may be used in any 
molecular biology techniques that have y^ to be devdoped, provided the new techniques rdy on 
properties of nucleotide sequences that are curraitly known, including, but not limited to, such 
15 properties as the triplet genetic code and specific base pair interactions. 

Without iurth^ elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest ejctent The following embodiments, are, 
therefore, to be construed as m^ely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

20 The disclosures of all patents, applications and publications, mentioned above and bdow, 

including U.S. Sor. No. 60/205,518, U.S. Ser. No. 60/213,956, U.S. Ser. No. 60/215,105, U.S. Ser. 
No. 60/218,947, and U.S. Ser. No. 60/221,488 are ©q)ressly incorporated by reference herein. 

EXAMPLES 

25 L Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described m the LIFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA) and shown in Table 4, colunm 5. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a 
suitable mixture of denaturants, such as TREOL (Life Technologies), a monophasic solution of phenol 

30 and guanidine isothiocyanate. The resulting lysates were centrifuged over CsQ cushions extracted 
with chlorofann. RNA was precipitated from the lysates with either isopropanol or socJium acetate and 
etbanol, or by other routine methods. 

Ph^l esKtraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
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using Qligo dCI^-coupled paramagneflc particles (Promega), OLIGOTEX latex particles (QIAGEN, 
aiatswarthCA),QranOUGOTEXinRNApurific^^ ALtemalivdy, RN A was 

isolated direcQy from tissue lysates using other RNA isolation kits, eg., the POLY(A)PURE niRNA 
purification kit (Ainbion, Austin TX). 
5 In some cases, Stratagene was provided with RNA aiKl constructed the corresponding cDNA 

libraries. Othenvise, cDNA was syn&esized and cDNAHbraries were constructed with 
vector system (Stratagene) or SUPERSCRIPT plasmid syston OLife Technologies), using the 
recommended procedures or similar metihods known in the art, (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) ReversetranscriptiQnwasinitiatedusingoligod(T) or random primers. Synthetic 

10 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
j^^jropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-sdected (300-1000 
bp) using SEPHACRYL SIOOO, SEPHAROSE CX2B, cr SEPHAROSE CL4B column 
cbroDiatography (AmersbamPbarmadaBiotedi)crprq»arativeagaro^ cDNAs 
were ligated into compatible restriction enzyme sites of thepolylinker of a suitable plasmid, e.g., 

15 PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid (Life Technologies), PCDNA2. 1 plasmid 
Otavilrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene). or pINCY (liicyte Genomics, Palo Alto 
CA), or derivatives tli^eof. Recombinant plasmids were transf(Mined into competent E. coli ceills 
including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DHIOB, or ElectroMAX 
DHIOB from Life Technologies. 

20 n. Isolation of cDNA Clones 

Plasmids obtained as described in Bxmsple I were recovered from host ceQs by in vivo excision 
using the UNEAP vector syston (Stratag«ie) or by ceO lysis. Plasmids were purified using at least 
one of the following: a Magic or WIZARD NDnipreps DNA purification system (Prom^a); an AGTC 
Miniprqp purification kit (Edge Biosystons, Gaithearsburg MD); and QLWELL 8 Plasmid, QIAWELL 

4 

25 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systrais or the R.E. AL. PREP 96 plasmid 

purification kit from QIAGEN. Following precipitation, plasmids were resuspended hi 0. 1 ml of 

distilled water and stored, with or without lypphilization, at 4*'C. 

Altemativdy , plasmid DNA was an5)lified firom host cell lysates using direct link PCR in a 

high-througlqjut format (Rao, y.B. (1994) Anal. Biochan. 216:1-14). Host cdl lysis and thermal 
30 cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 

wdl plates, and the concentration of anq)lified plasmid DNA was quantified fluorometrically using 

PICOGREEN dye (Molecular Probes, Eugoie OR) and a FLUOROSKAN E fluoresceaoce scanna 

(Labsyst^ns Oy, Helsinki, Finland). 

m. Sequencing and Analysis 
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Incyte cDNA recov^ed in plasmids as described in Example II were sequenced as follows. 
Sequeocing reactions were processed using standard methods or bigh-throughput instrumentation 
such as the ABI CATALYST 800 (AppUed Biosystems) thermal cyder or the PTC-200 thermal cyder 
(MI Research) in conjimction with the HYDRA miaodispenser (Robbins Scientific) or the 
5 MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prq)ared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing Idts such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (^plied Biosystems), 
Electrpphor^c sq)aration of cDNA sequendng reactions and detection of labded polynudeotides were 
carried out using the MEGABACE 1000 DNA sequ^idng systan (Molecular Dynamics); ttie ABI 
10 PRISM 373 or 377 sequendng system (Applied Biosystems) in conjunction with standard ABI 
protoccdsaiul base calling software; or otiier sequence analysis systei^ Readmg 
frames within the cDNA sequences were identified usmg standard methods (reviewed hi Ausubd, 1997, 
supra, unit 7.7). Some of the cDNA sequences were sdected for extension using the techniques 
disdosed in Example Vni. 

15 The polynucleotide sequoices darived from Incyte cDNAs ware validated by ranoving vector, 

linloea:, and poly(A) sequences and by masking ambiguous bases, using algorithms and programs based 
oa BLAST, dynamic programming, and dmudeotide nearest ndghbor analysis. The Incyte cDNA 
sequences or translations thereof were then queried against a sdection of public databases such as tiie 
GenBank primate, rodeait, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, 

20 DOMO, PRODOM, and hidden Markov modd (HMM)-based protein family databases such as PFAM. 
(HMM is a probabilistic approach which analyzes consensus primary structures of gene families. 
See. for example, Eddy. S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) Hie queries were 
performed usmg programs based <mi BLAST. FASTA, BLIMPS, and HMMER. The Incyte cDNA 
sequences were assembled to produce ftiU length polynucleotide sequoices. Alternativdy, GenBank 

25 cDNAs, GaiBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding 
sequences (see Exan?)les IV and V) wffe used to extend Incyte cDNA assonblages to fiill lengtii. 
Assembly was performed ushig programs based on Phred, Phrap, and Consed, and cDNA assemblages 
were screened for open readhig frames using programs based on GaieMark, BLAST, and FASTA 
The full l^gtti polynucleotide sequoices were translated to derive tiie corresponding full lengtii 

30 pdypq)tide sequoices. Alternativdy, a polypqptide of the invention may begin at any of the mi^onine 
residues of the full lengdi translated polypeptide. Full length polypq)tide sequraces were subsequenfly 
analyzed by querying against databases sudi as the GenBank protein databases (geape^tX SwissProt, 
BLOCKS. PRINTS, DOMO, PRODOM, Prosite, and hidden Martov modd (HMM)-based protdn 
family databases such as PFAM. Full length polynucleotide sequences are also analyzed using 
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MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASERGENE software (DNASTAR). Polynucleotide and pQlypq)tide sequence alignments are 
generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
MEGALIGN multisequaice alignmi^ program (DNASTAR), which also calculates the perc^ 
5 identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequm:es and provides ^licable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algcdthms used, the second 
colunm provides bri^ descriptions Ihaeof, the third column presents appropriate references, all of 

10 which are incorporated by rrference herein in Iheir aitirety, and the fourth column presorts, where 
applicable, the scores, probability values, and other parameters used to evaluate the strragth of a match 
betwe^ two sequoices (the higher the score or the lower the probability valu^ the greater the identity 
betwe^ two sequoices). 

The programs described above for the assembly and analysis of full loigth polynucleotide and 

15 polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:7-12. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 4. 
IV. Identification and Editing of Coding Sequences firom Genomic DNA 

Putative neurotransmitter transporters wffe initially identified by running the Geanscan gene 

20 idaitification program against public genomic sequence databases (e.g„ gbpri and gbhtg). Genscan is a 
graffl-al-purpose gene identification program which analyzes genomic DNA sequences from a variety of 
organisms (See Burge, C. and S. Karhn (1997) J. Md. Biol. 268:78-94, and Surge, C. and S. Karlin 
(1998) Curr. Qpia Struct. BioL 8:346-354). The program concatenates predicted exons to form an 
assenibledcDNAsequeQceextendhig from a methionine to a stop codoa Theoutputof Genscan is a 

25 FASTA database of polynucleotide and polypq)tide sequences. The mayimnTT) range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 
sequences encode neurotransmitter transporters, the aicoded po^ 

against PFAM modds for neurotransmitter transporters,- Potential neurotransmitter franqporters were 
also identified by honudogy to Incyte cDNA sequoices that had been annotated as neurotransmitter 
30 transporters. These selected Genscan-predicted sequences were then con?)ared by BLAST analysis to 
the ger5)q)t and gbpri public databases. Where necessary, the Geoscan-predicted sequences were then 
edited by con^arison to the top BLAST hit from genpept to correct ecrors in the sequence predicted by 
Gmscan, such as extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or 
public cDNA covorage of the Gaiscan-predicted sequooces, thus providing evidence for transcription. 



70 



wo 01/90148 



PCT/USO 1/16283 



When Lacyte cDNA coverage was available, this information was used to correct or confinn the 
Genscan predicted sequence. Full length polynucleotide sequences were obtained by ass^nbling 
Genscan-predicted coding sequaices with Incyte cDNA sequences and/or public cDNA sequences using 
the assenibly process described in Example m. Alternatively, full l^igth polynucleotide sequences were 
5 derived entirdy from edited or unedited Genscan-predicted coding sequences. 
V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Goiscan gene id^itification 
program described in Example IV. Partial cDNAs assembled as described in Exan5)le in were mapped 

10 to genomic DNA and parsed mto clusters containing rdated cDNAs and Genscan exon predictions from 
one or more genomic sequences. Eachcluster was analyzed using an algorithm based on graph theory 
and dynamic programmuig to hitegrate cDNA and genomic information, genfl:athig possible splice 
variants that were subsequently confirmed, edited, or extended to create a full length sequ^ice. 
S equoice hiter vals in which the entire length of the int^val was present on more than one s equaice hi 

15 the clusta: were identified, and hitorvals thus identified were considered to be equivalent by transitivity. 
For esanqde, if an interval was present on a cDNA and two gaaomic sequences, then aU three mtervals 
were considaed to be equivalent This process allows unrelated but consecutive genomic sequences to 
be brought together, bridged by cDNA sequooce. Intervals thus identified were then "stitdhied" together 
by the stitdiing algorithm in the order that Ihey appear along theJr parent sequences to generate the 

20 longest possible sequeoce, as weJl as sequence variants. linkages between intervals which proceed 
along one type of parent sequence (cDN A to cDNA (k gaiomic sequoice to genomic sequence) were 
given preforence over linkages vMch change parent type (cDNA to genomic sequence). The resultant 
stitched sequences were translated and compared by BLAST analysis to tiie geapept and gbpri public 
databases. Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit 

25 fromgoipept Sequences were ftulher extended with additional cDNA sequences, 
genomic DNA wti^ necessary. 
^^Stretched" Seqnences 

Partial DNA sequences were artmied to full length witti an jagorithm based on BLAST 
aiudysis. First, partial cDNAs assembled as described in Example m were queried against public 

30 databases such as the GaiBank prhnate, rodent, mammalian, vertebrate, and eukaryote databases using 
the BLAST program. The nearest GaiBank protein homolog was then compared by BLAST analysis 
to dther Incyte cDNA sequences or G^iScan exon predicted sequences described in Exan?)le IV. A 
chuneric protdn was generated by usmg the resultant higjh-scoring segment pairs (HSPs) to map the 
translated sequences onto the G^ank protein homolog. Insertions or ddetions may occur in the 
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chimmc protdn with respect to the anginal GenBank protein homolog. The GenBank protdn homolog, 
the chuneric protdn, or both were used as probes to seardi for homologous gmcnnic sequences from the 
public human genome databases. Partial DNA sequences were therefore "str^ched" or extended by the 
addition of liomcflogous genomic sequences. The resultant stretched sequences were examined to 
5 detennuie whether it contained a complete gene. 

VI. Chromosomal Mapping of NTT Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:7-12 were compared with sequences 
Jfrom the Inqyte LIFESEQ database and public domain databases using BLAST and otber 
inqdementations of the Smith- Wateman algorithm. Sequences from these databases that matched 

10 SEQ ID NO:7-12 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Cento- (SHGC), Whitehead Institute for 
Gmome Research (WIGR), and G6n6thon were used to det^rmhie if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 

15 of all sequences of that clust^, indudhig its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the traninus of the chromosome' s p- 
ann. (Hie centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On av^age, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

20 humans, although this can vary widely due to hot and cold spots of recombination.) The cM 

distances are based on genetic markers mapped by G6n6thon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 
otito resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(httpy/www Jicbi.nlm.nilLgov/genemap/), can be employed to determine if previously identified 

25 disease genes map within or in proximity to the intervals indicated above. 
Vn. Analysis of Polynucleotide Expression 

North^n analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labded nucleotide sequaice to a membrane on which RNAs from a 
particular ceU type or tissue have been bound (See, e.g., Samhrook, supra, ch. 7; Ausubd (1995) 

30 supra , ch, 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
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search can be modified to deternune whetber any particular matcb is categorized as exact or simflar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x PCTceot Identity 
5 5 X minimum {lengtii(Seq. 1), lengtii(Seq. 2)} 

The product score takes into account both the degree of shnilarity betweai two sequences and the lengtii 
of the sequ^ce match. Theproduct score is a nornialized value between 0 and 100» and is calculated 
as follows: tlie BLAST score is multiplied by the pearcaat nucleotide idootity and the product is divided 

10 by (S times the lengtii of the shoter of the two sequences). The BLAST score is calculated by 

assigning a score of +5 for every base tiiat matches in a high-scoring s^ment pair (HSP), and -4 for 
every mismatch. Twosequencesmay share more than one HSP (sq)arated by gaps). If there is more 
than one HSP, thiMitiiB pair with the Mgbest BLAST score is nsed to calculate The 
product score rq)resents a balance between firactional overlap and quality in a BLAST fliigninfint, For 

15 example^a product score of 100 is produced only for 1(K)% identity ova: tiie entire lengtii of tiiesho^ 
of tiie two sequences bdng compared A product score of 70 is produced either by 100% identity and 
70% overlap at one aid, or by 88% identity and 100% overlap at the ottier. A product score of 50 is 
produced either by 100% identity and 50% overlap at one aid, or 79% identity and 100% overlap. 

Altanativeiy, polynucleotide sequences aicoding NTT are analyzed with respect to the tissue 

20 sources fi:om which they were derived. For exan?)le, some fufl lengtii sequences are assenibled, at least 
in part, mih overlapping Incyte cDNA sequences (see Example HI). Each cDNA sequence is derived 
firom a cDNA library constructed fi:om a human tissue. Each human tissue is classified into one of the 
foUowing organ/tissue categories: cardiovascular system; connective tissue; digestive system; 
esmbryonic structures; endocrine system; exocrine glands; genitalia, female genitalia, male; germ cdls; 

25 honic and immune system; liver; musculoskeletal system; nervous systan; pancreas; respiratory 

system; saise organs; sldn; stomatognatinc system; undassified/mixed; or urinary tract Hie nnmbear of 
libraries in each cat^ory is counted and divided by the total number of libraries across all categories. 
Similarly, each human tissue is classified into one of the foILowhig disease/condition cathodes: cancer, 
cdl line* developmental, inflammation, nairdogical, trauma, cardiovascular, pooled, and ottier, and the 

30 number of libraries in each cat^ory is counted and divided by the total number of libraries across an 
cat^ories. The resulting percentages reflect the tissue- and disease-specific expr^^^ 
©ttcoding NTT. cDNA sequences and cDNA library/tissue information are found in the LIFESEQ 
GOLD database (Incyte Genomics, Palo Alto CA). 
Vm. Extension of NTT Encoding Polynucleotides 
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Full I^igth polynucleotide sequences also produced by extension of an appropriate 
fragment of the full length molecule using oligonucLeotide primm designed from this fragmoot. One 
primer was synthesized to initiate 5 'extension of the known fragment, and the othor primer was 
synthesized to initiate 3' extension of the known fragm^t. The initial prim^ were designed using 
5 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in laigth, to have a GC cont^ of about 50% or more, and to anneal to the targ^ sequence 
at tenQ)eratures of about 68 ''C to about 72'*C. Any str^di of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Sdected hmnancDNA libraries were used to ext^ the sequence. If more than one extension 

10 was necessary or desired, additional or nested s^ of primers were designed. 

Higlifiddity ampMfication was obtained by PGR using methods wdl known in the P£R 
was porfcamed in 96-wdl plates ushig the PTC-200 thermal cycler (MJ Research, Inc.). Hie reaction 
mix contained DNA template^ 200 nmol of each primer, reaction buffer containing Mg^, (NH4)2S04, 
and 2-mercapto^hanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 

15 (Life Technologies), and Pfti DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94*C, 3 min; Step 2; 94**C, 15 sec; Step 3: 60^C. 1 min; Step 4: 68 X, 
2 min; Step 5: Steps 2, 3. and 4 repeated 20 times; Step 6: 68 'C, 5 mm; Step 7: storage at 4''C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94*C, 3 mii^ Stqp 2: 
94**C, 15 sec; Step 3: 57*C, 1 mhi; Step 4: 68'C, 2 mim Step 5: Steps 2, 3, and 4 repeated 20 times; 

20 Stq) 6: 68 'C, 5 min; Step 7: staage at 4^C. 

Hie concentration of DNA in each wdl was d^ermined by dispensing 100 |il PICOGREEN 
quantitationreagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugeme OR) dissolved in IX TE 
and 0.5 Hi of undiluted PGR product into each wdl of an q)aque fiuorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 

25 (Labsystems Oy, Hdsinki, FinlancQ to measure the fhioresc^ce of the sample and to quantify the 

concentration of DNA. A5 ^^Ito 10 //l aliquot ofthe reaction mixture was analyzed by dectrophoresis 
onal % agarose gd to detenninewMch reactions were successful in extending the sequence. 

The extended nudeotides were desalted and concentrated, transferred to 384-weail plates, 
digested with CviJI diolera virus endonuclease (Molecular Biology Research, Madison WI), and 

30 sonicated or sheared prior to rdigation into pUC 18 vector (Amersham Pharmacia Biotedi). For 

shotgun sequ^icing, the digested nudeotides were sq)arated on low concentration (0.6 to 0.8%) agarose 
gds, fragments were excised, and agar digested with Agar ACE (Promega). Extended dones were 
rdigated using T4 ligase (New Engjand Biolabs, Beverly MA) into pUC 18 vector (Amarsham 
Pharmada Biotedi), treated with Pfii DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
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and transfected into coiq>eteiit E. coli cdls. Transfonned cells were selected on antibiotic-cQntaiBing 
media, and individual colonies were picked and cultured overnight at 37'C in 384-wdl plates in LB/2x 
carb liquid media. 

TTie cells were ly sed, and DN A was amplified by PC R using Taq DNA polymerase ( Amersham 



94*^0, 3 min; Step 2: 94**C, 15 sec; Step 3: 60^*0, 1 min; Step 4: 72*C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72**C, 5 min; Step 7: storage at 4*'C. DNA was quantified by PICOGREEN 
reagMit (Molecular Probes) as described abova Samples with low DNA recoveries were reanq)lified 



using the same conditions as desoibed above. Samples were diluted with 20% dhnethysulfoxide (1 :2, 
10 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT lot (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 



In like manner, full leaigtii polynucleotide sequences are verified using the above procedure or 
are used to obtain 5 ' r^ulatory sequences using the above procedure along with oligonucleotides 

15 designed for such ejctension, and an appropriate genomic library. 
JX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes daived fi:om SEQ ID NO:7-12 are enq)loyed to screen cDNAs, genomic 
DNAs, or mRNAs. Although the labeUi^ of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same jarocedure is used with larger nucleotide fi-agmraits. 

20 Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 fiCi of [y-^] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10^ counts per 

25 minute of tiie labeled probe is used in a typical maribrane-based hybridization analysis of human 
genomic DNA digested with one of tiie following aidonudeases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
Pvun (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gd and transfeaxed to nylon 
manbranes (Nyhran Plus, Sdildcha: & Sdmdl, Durham NH). Hybridization is carried out for 16 
30 hours at 40 To remove nonspecific signals, blots are sequoottially washed at room temperature 
undo: conditions of up to, for exan5)le, 0.1 x saline sodium dtrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
con^ared. 

X. Microarrays 



5 Pharmada Biotech) and Pfu DNA polymerase (Stratagene) with tlie following parameters: Stqp 1 : 




sequmdng ready reaction Jtit (Applied Biosystems). 



75 



wo 01/90148 



PCT/USOl/16283 



The linkage or synthesis of array dements upon a microarray can be acliieved utilizing 
photolilliography. piezodectric printing (ink-jet printing, See» e,g., Baldesdiwdler, supra .), mechanical 
microspotting tecimologies, and derivatives thereof. The substrate in each of tilie aforaneationed 
tedmologies should be umfam and solid with a non-porous surface (Sdiaia (1999), supraV Suggested 
5 substrates indude silicon, silica, glass slides, glass diips, and silicon wafers. Altemativdy, a procedure 
analogous to a dot or slot blot may alsobeusedtoarr ange and link dements to the surface of a 
substrate using Hiermal, UV, diomical, or mechanical bonding procedures. A typical array may be 
produced usmg available mrthods and madiines weJl known to those of ordinary skill in the art and may 
contain any appropriate number of demfitots. (See, e.g., Schena, M d al. (1995) Sdence 270:467-470; 
10 Shalon. D. d al. (1996) Geaiome Res. 6:639-645; Marshall, A and J. Hodgson (1998) Nat Biotechnol. 
16:27-31.) 

Rin laigth cDNAs, Expressed Sequence Tags OESTs), or fragments or oligomers lhaeof may 
comprise the dOTients of the microarray. Fragments or oligomers suitable for hybridizatian can be 
sdected using software wdl known in the art sudi as LASERGENE software (DNASTAR). The array 

15 demons are hybridized with polynucleotides in a biological san5)le. The polynudeotides in the 
bidogical sain)le are conjugated to a fluoresceat labd or otho: molecular tag far ease of detectioa 
After hybridization, nonhybridized nucleotides from the biological smple are removed, and a 
fluaresceoce scann^ is used to ddect hybridization at each array demoit. Alternativdy, laser 
desorbtion and mass spectronaetry may be used for detection of hybridizatioa Tbe degree of 

20 conqdementarity and therdative abundance of each polynudeotide v\*idi hybridizes to an dement on 
the miaoarray may be assessed. In one embodiment, microarray preparation and usage is described in 
d^lbdow. 

Tissue or Cdl Sample Preparation 

Total RNA is isolated from tissue samples ushig the guanidinium thiocyanate method and 

25 poly(A)* RNA is purified using the oligo-(dT)cdlulose method. Each poly(A)* RNA sample is 
reverse transcribed using MMLV revoise-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mer), IX firat 
strand buCfer, 0.03 units/pl RNase inhibitor, 500 \M dATP, 500 \M dGTP, 500 jiM dTTP, 40 iM 
dCTP, 40 mM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Phatmacia Biotech). The reverse 
transcription reaction is p^ormed in a 25 ml volume containing 200 ng poly(A)* RNA with 

30 GEMBRIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by hi vitro transcription 
from non-coding yeast genomic DNA Aft^ incubation at 37" C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 nil of 0.5M sodium hydroxide and 
mcubated for 20 minutes at 85** C to the stop the reaction and degrade the RNA Samples are purified 
using two successive CHROMA SPIN 30 gd filtration spin columns (CLONTECH Laboratories, Inc. 
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(CLONTECH), Palo Alto CA) and after combinmg, toth reaction saujples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml). 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
thea dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSCyO.2% SDS. 
5 Mtcroarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cdls containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complraimtary to the vector sequences flanking the cDNA insert Array dements are 
anq)lified in thirty cycles of PCR jBx>m an initial quantity of 1 -2 ng to a final quantity greater than 5 
10 Mg. Anq)lified array elements are then purified using SEPHACRY1>«)0 (Ameisham Pharmacia 
Biotech), 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0. 1% SDS and acetone, with ejrtensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
15 Scientific Products Corporation (VWR), West Chests PA), washed ejrtensively in distiUed water, and 
coated witti 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10**C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated haein by reference. 1 pi of the array element DNA, at an average 
20 concentration of 100 ng/pl, is loaded mto the open capillary printing element by a high-speed robotic 
^paratus. The apparatus then dq)osits about 5 nl of array dement sample per slide. 

Microarrays are UV-crosslinked using aSTRATAUNKER UV-crosslinker (Stratagene). 
Microarrays are washed at room tempa:ature once in 0.2% SDS and three times in distiUed water. 
Non-specific binding sites are blocked by incubation of miaoarrays in 0.2% casdn in phosphate 
25 buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60°C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 jig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

30 mixture is heated to 65 ° C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm^ covarslip. The arrays are transferred to a waterproof chamber having a cavity just sUghtly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 pi of 5X SSC in a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60° C. The arrays are washed for 10 min at 45** C in a first wash buffer (IX SSC, 

35 0.1% SDS), tbree times for 10 minutes each at 45''C m a second washbuffer (O.IX SSC), and dried. 
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Detection 

Reporter-labeled hybridizatioii complexes are detected with a mioroscope equipped with an 
Innova 70 mixed gas 10 W las^ (Coherent, Ina, Santa Qara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation iBser light is 
5 focused on the array using a 20X miaoscope objective (Nikon, Inc., Mdville NY). The slide 
containing the array is placed on a computer-controUed X-Y stage on the miaoscope and raster- 
scanned past the objective. The 1.8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorpphores sequentially. 

10 Emitted light is split, based on wavelength, into two photomultiplia* tube detertors (PMT R1477, 
Hamamatsu Photonics Systems, Biidgewat^ NJ) corresponding to the two fluorophores. Appropriate 
filt^ positioned between the array and the photomultiplier tubes are used to filter the signals. The 
anission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 

15 although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity gemmated by a 
cDNA control species added to the sample mixture at a Imown concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 :100,000. When two samples 

20 from different sources (e.g., representing test and control cells), each labeled with a different 

fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentiaUy 
e3q)ressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amoxmts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

25 (A/D) conva^ion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 

30 emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
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XI. Complementary Polynucleotides 

SequQQces complementary to the NTT-encoding sequaices, or any parts thereof, are xised to 
detect, decrease, or inhibit expression of naturally occurring NTT. Although use of oligonucleotides 
comprisuig from about 15 to 30 base pairs is described, essentially the same procedure is used with 
5 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and ttie codling sequence of NTT. To mMbit transcription, a 
compl^nentary oligonucleotide is designed from the most unique 5 ' sequence and used to prevent 
promoter binding to the codmg sequence. To inhibit translation, a conq)lementary oligonucleotide is 
designed to prevent ribosomal binding to the NTT-encoding transcsipt. 

10 Xn. Expression of NTT 

Expression and purification of NTT is achieved using bacterial or virus-based expression 
systems. For e35)ression of NTT in bacteria, cDNA is subdoned into an appropriate vector contaimng 
an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA transcriptioa 
Exanoples of such promotes include, but are not limited to, the trp-lac (toe) hybrid promoter and the 

15 T5 or T7 bacteriophage prcmoter in ooiqunction with the lac operator r^ulatory elemenL 

Recombmant vectors are transformed hito suitable bacterial hosts, e.g., BL21(DE3). Antibiotic 
resistant bacteria express NTT upon induction with iscprqpyl b^-D-thiogalactopyranoside (DPTG). 
Expression of NTT in eukaryotic ceils is achieved by infiectmg insect or mflmm aiiflTi cdl lines with 
recombinant Autogaphic a califnmi^;^ nuclear polyhedrosis virus (AcMNPV), commonly known as 

20 baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding NTT by 
either homologous recombination or bacterial-mediated transposition involving transfer plasmid 
intermediates. Viral infecttvity is maintained and the strong polyhedrin promoter drives high levels of 
cDNA transcriptioa Recombmant baculovirus is used to infect Spodoptera frueiperda (Sf9^ insect 
cells in most cases, or human hq)atocytes, in some cases. Infection of the latter requires additional 

25 genetic modificatiosos to baculovirus. (See Engdhard, E.K et al. (1994) Proc. Nafl. Acad Sd. USA 
91:3224-3227; Sandig, V. et al. (1996) Hum. G^Ther. 7:1937-1945.) 

In most expression systans, NTT is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide q)itope tag, such as FLAG or 6-His, permitting rapid, single-stqp, 
affinity-based purification of recombinant fusion protdn from crude ceil lysates. GST, a 26-kilodalton 

30 enzyme from Schistosnma iaponicunL ^bles the purification of frision protdns on immobilized 
glutathione under conditions that maintain protdn activity and antigenicity (Amersham Pharmacia 
Biotedi). Following purification, tiie GST moiety can be proteolytically cleaved from NTT at 
specifically engmeered sites. FLAG, an 8-amino acidpqptide, enables imraunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
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His, a stretdi of six consecutive histidine residues, enables purification on metal-cbeiate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10andl6). Purified NTT obtained by 1]iesem^lK)ds can be used directly 
Examples XVI and XVII wb^e applicable. 
5 Xni. Functional Assays 

NTT function is assessed by expressing the sequences encoding NTT at physiologically 
devated levds in mammalian cdl culture systems. cDNA is subdoned into a mammalian egression 
vector containing a strong promoter Ilut drives Mghlevds of cDN A esip^ Vectors of cboice 
include PCMV SPORT OJfe Tedmologies) and PCR3.1 (Invltrogeai, Carlsbad CA), both of vMctx 
10 contain the cytomegalovirus promoter. 5-10 //g of recombinant vector are transiently transfected into a 
human cdl line^ for example, an ^othelial or h^natqxsietic cdl line, using eitfa^ liposome 
formulations or dectrq)oratioa 1-2 /^g of an additional plasmid containing sequences ^icoding a 
marker protdn are co-transfected. E?^essionof a marker protdn provides a means to distinguish 
transfected cdls fi*om nontransf ected cdls and is a rdiable predictor of cDNA expression from the 
15 recombinant vector. Marba* protdns of choice uidnde» e.g.. Green Fluorescent Protdn (GFP; 

Clontech), CD64, or a CD64-GFP fusion protdn. Flow cytometry (FCM), an automated, laser optics- 
based tedmique, is used to idaitify transfected cells expressing GFP or CD64-GFP and to evaluate the 
qxsptotic state of the cdls and otiiercdlular properties. FCM detects and quantifies the uptalce of 
fluorescent molecifles that diagnose ev^ preceding or cdnddentwi These ev^ include 

20 dianges in nudear DNA content as measured by staining of DNA with propidium iodide; changes in 
cdl size and granularity as measured by forward ligjit scatter and 90 degree side ligjit scatta:; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
eqnression of cdl surface and intracdlular protdns as measured by reactivity witii specific antibodies; 
and alterations in plasma membrane composition as measured by thebhiding of fluorescdn-conjugated 
25 AnnexinV protdn to the cdl surfiice. Metbods in flow cytometry are discussed in Qrmerod.M.G. 
(1994) FlowCvtometrv . Oxford, New York NY. 



The influence of NTT on gene repression can be assessed using highly purified populations of 
cdls transfected with sequences encoding NTT and dflier CD64 or CD64-GFP. CD64 and CD64-GFP 
are expressed on the surface of transfected cells and bind to conserved regions of human 
30 immunoglobulhi G (IgG). Transfected ceDs are effideaifly sq)arated JBrom nontransfected cdls usmg 
magnetic beads coated wifli dther human IgG or antibody against CD64 (DYNAL, Lake Success NY), 
mRNA can be purified firomtiieceJlsushig methods wdl known by those of skin in ^ Expression 
of mRNA encoding NTT and other gaies of interest can be analyzed by northern analysis or microarray 
tedmiques. 
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XIV. Prodiictloii of NTT Specific Antibodies 

NTT substantially purified using polyaaylamide gd electrophoresis (PAGE; see, eg., 
Harrington, M.G. (1990) Methods EnzymoL 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
5 Altanativeiy, the NTT amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to detennine r^ons of high immunogenidty, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means Imown to those of sldn in tte Mrthodsfor 
selection of appr(?)riate qMtq)es, such as those near the C-temmins or in hydrophilic regions are weSl 
desaibed in the art. (See, e.g., Ausubd, 1995, supra> ch. 11.) 

10 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI431A 

pq>tlde synthesizer (Applied Biosystems) using FMOC chanistry and coupled to KLH (Sigma- 
Aldrich, St Louis MO) by reaction with N-maldnudobenzoyl-N-hydroxysuccimmide ester (MBS) to 
increase immunogenicity. (See, ag., Ausubd, 1995, Rabbits are immunized wiHi the 

oUgopeptide-KLH complex in conoplete Freund's adjuvant Resulthig antisera are tested for antipq)tide 

15 and anti-NTT activity by, for ecample, bhiding thepqptide or NTT to a substrate, blocking with 1% 
BSA, reacting with rabbit antis^a, washing, and reacting with radio-iodmated goat anti-rabbit IgG, 

XV. Purification of Naturally Occurring NTT Using Specific Antibodies 
Naturally occurring or recombinant NTT is substantially purified by immunoafSni^ 

dbromatography using antibodies specific for NTT. An immunoaffinity ccflumn is constructed by 
20 covalently coupling anti-NTT antibody to an activated diromatographic resm, such as CNBr-activated 

SEPHAROSE (Amorsham Pharmacia Biotech). Afta* the coupling, the resin is blocked and washed 

according to thue manufacturer's instructions. 

Media containing NTT are passed over the immunoaffinity column, and the cohmin is washed 

undo- conditions that allow the prefa-ential absorbance of NTT (e.g., high ionic strength buffers in the 
25 presence of detergent). The column is eiluted undo: conditions that disrupt antibody/NTT bindhig (e.g., 

a buffer of pH 2 to pH 3, or a high concenfration of a chaotrqpe, such as urea or thiocyanate ion), and 

NTT is collected. 

XVI. Identification of Molecules Which Interact with NTT 

NTT, or biologically active fragments thereof, are labded with ^ Bolton-Hunter reagent. 
30 (See, e.g., Bolton AE. and W.M. Hunta: (1973) Biodbem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wdls of a multi-weJl plate are incubated with the labeled NTT, washed, and 
any weills with labded NTT con?}lex are assayed. Data obtamed using different conceotrations of NTT 
are used to calculate values for the nunabor, affinity, and association of NTT with the candidate 
molecules. 
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Alternatively, molecules interactiiig with NTT are analyzed using the yeast twohyhrid 
system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commerdany 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Qontech). 

NTT may also be used in the PATHCALUNG jnrocess (CuraGen Ccop., New Haven CT) 
5 which en^loys the yeast two-hybrid systaoa in a higbrthrougtq>ut manner to detennine all interactions 
b^eoi the protdns encoded by two large libraries of g^ies (Nandabalan, K. et aL (2000) U.S. Patent 
No. 6,057,101). 

XVn. Demonstration of NTT Activity 

Measuremoits of NTT activity include tracer fluxes and dlectrqphysiQLogical approaches. 

10 Tracer fluxes are danoostrated by measuring uptake of labeled substrates into Xmoims laevis oocytes. 
Ooqies at stages V and VI are injected with NTT mRNA ( 1 0 ng per oocyte) and hicubated for three 
days at 18**C in 0R2 medium (82.5mM NaQ, 2.5 mM KQ, ImM CaCla, ImM MgCl^, ImM 
Na2HP04. 5 mM Hepes, 3.8 mM NaOH , 50|ig/ml gaitamydn, pH 7.8) to allow expression of NTT 
protda Oocytes are tiien transferred to standard uptake medram (lOQmM NaQ, 2 mM KQ. ImM 

15 CaOj, ImM MgCl^, 10 mM Hepes/Tris pH 7.5). Uptake of various neurotransmitters is initiated by 
adding a ^H substrate to the oocytes. After incubating for 30 minutes, uptake is terminated by washing 
tbe oocytes three tunes in Na*-fi:ee medium, measuring the incorporated ^H, an^ 
CQntrds. NTT activity is prqKMtional to the level of internalized ^H substrate. 

Altemativdy, NTT activity can be demonstrated using an dectrqphysiological assay for ion 

20 conductance. Capped >riT mRNA transcribed with T7 polymerase is iigectedM^ 

V Xaiopus oocytes, similar to previously described. Two to seven days later, transport is measured by 
two-electrode voltage damp recordiAg. Two-electrode voltage damp recordings are performed at a 
hdding poteaitial of 50 mV. The data are filtered at 10 Hz and recorded with MacLab digital-to-analog 
converter and software fca: data acquisition and analysis (AD Ihstrumaits, Castle Hm, Australia). To 

25 study the dependaice of NTT on ejcternal ions, sodium can be rq)laced by choline or 

N-mefhyl-D-glucamine and chloride by gluconate, NO3, or SO4 (Kavanaugh, M.P. et al. (1992) J. Biol. 
Chem. 267:22007-22009). 

In the alternative, choline transporter activity or dioline-transporter-like CTLl protein 
activity of NTT is determined by measuring choline uptake by yeast transformed with expression 

30 vectors harboring polynucleotides encoding NTT. The assay is performed hi nitrogen-free medium at 
30*'C for 10 or 30 min in the presence of 25 nM [^H]choline. The cells ate then filtered, and washed. 
The amount of [^H]choline present in the cefls is proportional to the activity of NTT in the cells 
(O'Regan, S. supra) . 
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Various modificalions and vaiiatiom of the described methods and systems of the invention will 
be apparent to those skffled in the art without dqpartmg from the scope and spirit of the inv^on. 
Although the inveaition has been described in connection with catain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
5 bdeed, various modifications of the described modes feu: carrying out the invention wliich are obvious 
to those skilled in molecular biology or rdated fidds are intended to be witiiin the scope of the f oillowing 
dahns. 
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GenBank Homolog 


[Rattus norvegicus] GABA transporter 

(Borden, L. A. et al. (1992) J. Biol. Chem. 267, 21098- 
21104) 


[Rattus sp.] sodixim-dependent neurotransmitter 
transporter 

(Uhl,G.R. et al. (1992) Brain Res. Mol. Brain Res. 16 
(3-4), 353-359) 


[Homo sapiens] vesicular monoamine transporter VMATl 
(Erickson, J.D. et al. (1996) Proc. Natl. Acad. Sci. 
U.S.A. 93 (10), 5166-5171) 


[Homo sapiens] taurine transporter | 
(Miyamoto, Y. et al. (1996) Curr. Eye Res. 15 (3), 345- 
349) 


[Homo sapiens] dopamine transporter 

(Vandenbergh,D. J. et al. (1992) Brain Res. Mol. Brain 
Res. 15 (1-2), 161-166) 


[Rattus sp.] neurotransmitter transporter rB21a 
(Smith, K.E. et al. (1995) FEES Lett. 357 (1), 86-92) 
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55046086H1 


6914742F6 
(PITUDIROl) 


2289873R6 
(BRAINONOl) 


4181117H1 
(SINITUT03) 


rH 
VO 

a\ 
o 

CN 
00 
t3) 


2896553F6 
(KIDNTUT14) 


7117217H1 
(BRAHNOEOl) 


2419627R6 
(SCORNON02) 


2419627T6 
(SCORNON02) 


Selected 
Fragment ( s ) 


1383-1520 


1 1383-1520 


1383-1520 


o 

CN 

m 

H 

) 

m 

00 

n 

T-H 


1383-1520 


1 1383-1520 


11383-1520 1 


1 1383-1520 1 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


vo 

H 

00 
CN 
1 

VD 

in 

LO 
CN a\ 

O 1 

in 00 

rH ^ 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


vo 

rH 
00 
CN 
1 

VO 

in 

in 
CN a\ 

-rH 
O 1 

in CO 
1 a\ 

rH 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


Sequence 
Length 


2168 


1 2168 


2168 


1 2168 


2168 


1 2168 1 


1 2168 1 


1 2168 1 


2709 


2709 


1 2709 


2709 


2709 


1 2709 


2709 


2709 


2958 


2958 


2958 


2958 


2958 


2958 


Incyte 

Polynucleotide ID 


7617689CB1 


1 7617689CB1 


7617689CB1 


1 7617689CB1 


7617689CB1 


1 7617689CB1 1 


1 7617689CB1 1 


1 7617689CB1 1 


6881669CB1 


6881669CB1 


1 6881669CB1 


6881669CB1 


6881669CB1 


1 6881669CB1 


6881669CB1 


6881669CB1 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


Polynucleotide 
SEQ ID NO: 




00 
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3 ' Position 1 




1949 


H 
H 


1 1263 1 


CN 
CN 
CN 


1 1381 I 


00 

vo 

H 
CN 


1987 


vo 
H 
VO 


1179 


00 

vo 

00 


1392 1 


o 

CN 


1953 


1 2709 


2174 


vo 

H 
CN 


2958 j 


1730 j 


1245 


2161 


2949 j 


5' Position 


O 
H 
CN 


1342 


o 

CM 


o 

cn 
m 


H 


tH 


1 1545 1 


1369 


n 

00 


o 

00 

in 


m 

H 
H 


00 


H 


1328 


1836 


1735 


H 


2549 


1109 


cn 


1599 


2289 


Sequence Fragments 


5407696F6 
(BRAMNOTOl) 


1 70501086V1 


5407696F8 
(BRAMNOTOl) 


1 70505706V1 


757515H1 
(BRAITUT02) 


I70501894V1 


1 70505220V1 


1 70501380V1 1 


6290242H1 
(EPIPUNAOl) 


7186875H1 
(BMARTXR02) 


XJ 
•H 

1 

cn 
vo 
vo 

iH 
00 
00 

vo 
tn 

H 
PQ 
O 


4761061F8 
(BRAMNOTOl) 


7372128H2 
(BRAIFEE04) 


55046086H1 


6914742F6 
(PITUDIROl) 


2289873R6 
(BRAINONOl) 


4181117H1 
(SINITUT03) 


H 
VO 

cn 
o 

OJ 
00 

tf) 


2896553F6 
(KIDNTUT14) 


7117217H1 
(BRAHNOEOl) 


2419627R6 
(SCORNON02) 


2419627T6 
(SCORNON02) 


Selected 
Fragment ( s ) 


1383-1520 


1 1383-1520 


1383-1520 


1 1383-1520 


1383-1520 


1 1383-1520 


1 1383-1520 


11383-1520 1 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


1 

o 

* tH 

cn vo 

O 1 

o 
CN m 
J m 
cn 
ro 

m ro 

CN 


2539-2709, 74- 
93, 330-610 


I 

o 

* iH 

o^ vo 

O 1 

o 
CN ro 
1 ro 
cn 

ro ^ 
m ro 

CN CTk 


^ 

o 
-H 

cn vo 

O 1 

o 
ro 

1 ro 

cn 

ro ^ 
in ro 
CN o^ 


2539-2709, 74- 
93, 330-610 


2539-2709, 74- 
93, 330-610 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


Sequence 
Length 


2168 


1 2168 


2168 


1 2168 


2168 


1 2168 


1 2168 


1 2168 1 


2709 


2709 


2709 


2709 


2709 


2709 


2709 


2709 


2958 


2958 


2958 


2958 


2958 


2958 


Incyte 

Polynucleotide ID 


7617689CB1 


1 7617689CB1 


7617689CB1 


7617689CB1 


7617689CB1 


1 7617689CB1 


1 7617689CB1 


1 7617689CB1 


6881669CB1 


tH 

« 
u 
cn 
vo 
vo 

rH 
00 
00 

vo 


6881669CB1 


6881669CB1 


6881669CB1 


6881669CB1 


6881669CB1 


6881669CB1 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


1 Polynucleotide 
SEQ ID NO: 




00 


<n 
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3 ' Position | 


00 


2950 j 


vo 
ro 

CO 


2356 1 


2135 j 


1484 j 


1 2135 J 


1178 j 


00 

in 


00 
CTl 
VO 


1997 


1 


2774 1 


2494 j 


2441 1 


5 ' Position 


ro 
ro 
m 


2397 


cn 
vo 

H 


1791 


1526 


1142 


ro 

CN 


O 

m 


rH 


H 


o 

CN 
H 


H 


2091 


1844 


1790 


Sequence Fragments 


55047291H1 


5777360H1 , 
(BRAINOT20) ' 


55047293J1 ^ 


1398158F6 
(BRAITUT08) 


7685653H1 
(BRABDIK02) 


7723816H1 
(THYRDIEOl) 


Q 
O 

00 

ro 
ro 
cr» 
cr» 


7641640H1 
(SEMVTDEOl) 


7620645H1 
(HEARFEE03) 


6866221H1 
(BRAGNON02) 


1 

<N 
VO 

CO 

o 

H 

3 

O 

00 vo 
CM in 
vo 

^ H 

00 
a H 


1 

vo 

in 
ro 

CN 

o 
ro 

ro 
m 

m 00 
m CN 
o 

rH 

ay 
til tn 


70646995V1) 


71867729V1 


71869071V1 


Selected 

Fragment ( s ) 1 


1-50, 2756-2816, 
498-1495 


1-50, 2756-2816, 
498-1495 


vo 
H 
00 
CN 
1 

VO 

in 

in 

-H 
O 1 

in 00 

1 CJ> 
H 


1-50, 2756-2816, 
498-1495 


m 
ro 

H 
CN 

00 

o 

CN 

vo 

CN 
1 

H 


1-261, 2048-2135 


m 
ro 

rH 
CN 
1 

00 

o 

CN 

rH 
VO 
CN 
1 

rH 


1-261, 2048-2135 


1-261, 2048-2135 


1339-1469, 1- 
116, 1576-1607, 
893-1145, 1950- 
1997, 689-744 


1339-1469, 1- 
116, 1576-1607, 
893-1145, 1950- 
1997, 689-744 


1-259, 2047- 
2318, 2715-2774 


1-259, 2047- 
2318, 2715-2774 


1-259, 2047- 
2318, 2715-2774 


1-259, 2047- 
2318, 2715-2774 


Sequence 
Length 


2958 


2958 


2958 


2958 


2135 


2135 


1 2135 1 


2135 


2135 


1997 


1997 


2774 


2774 


2774 


2774 


Incyte 

Polynucleotide ID 


2742486CB1 


2742486CB1 


2742486CB1 


2742486CB1 


71556695CB1 


71556695CB1 


PQ 
U 

in 

Vo 

vo 
in 
m 

rH 


71556695CB1 


1 71556695CB1 


7472800CB1 


7472800CB1 


7475553CB1 


7475553CB1 


7475553CB1 


7475553CB1 


j Polynucleotide 
1 SEQ ID NO: 




o 

H 


rH 
H 


CN 
H 
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in 



ary 














u 














rQ 














•H 










































% 














•H 














4J 














(0 


















OJ 












O 


o 


H 


O 


o 


O 










t« 










H 




P 




1 


Q) 






1 




! 




M 


H 


i 










Pi 






CO 






Re 










3 

K 




w 


rn 


M-l 


M 


M-l 


PU 










H 






Q 


rH 


rH 


t— 1 


PQ 


rH 


iH 


H 


« 


0) 


PQ 


O 


PQ 


P3 




r t 


r 1 






O 




4J 






VD 


a\ 


o 


m 


Q) U 


00 


\o 


00 




o 


in 




VO 


VD 




vo 


00 


m 


>i-n 




H 




tn 




m 


O 0 


H 


00 




in 










00 




H 






H CU 




vo 
























3 










































0 














0) Q 
























































rH O 














o pq 








o 


rH 


OJ 


04 CO 




GO 




H 


H 
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0) 



o 

4-> 



ESS 

(0 H 
Q) 0) tn D) 



• I (d b> M 0) »d -H -H 
g 0) o > (d -H ^ ^ 



0) 

• a 52 

>irH (U <U tW 
^ i-M 4^ 

o M 0) *d 

^ JJ 4J -H (U (d 



m :i 

•H U iJ 
4J -H -H 

•rl M O 



to ^ 
td o^ 

(d W 4J 



. , - M rt: cd 



0) 

8^ 




C! C4 M O rH O 0 4J 0 U -iHH 
HO^rH <UrH ' 

M -H -H M to 
tfltU g (U CQ iH 



0) 

4J H 



H 4J 



0) U 

o 



tW -nl t-J O 
CO -H 

Q) (d a (d 

■U 4J (d M . _ _ 

U CO O -H O T3 <U 

(d Q) rH M-i © o 

Q) :i tn q 0) 
Ti>4J0wtdS(u _ , 

W i-l Q Pi h 4J O^TjqOH 

(d i;^a}unx:EH(i)B Qhh o) 

>iW TJcdM-l&WWCUCQ (d 

Mwa)d C(d<u:30)a)McQ • 
rd'^^0^3a>•r^r^O)af:JX^a) X. 

^4Jm 4J DjC;-HOOS'd(DtQJCj4J 
^ a3a)tdO •HWU.H.H Q)^0 to 

oog(dgfiNa)ua).H 

Or^04J03r^M ^^AJ 
W>,4-)4->CQ <UrHB 

o e a-H to ^ ^ ^ " 
M m 0) 0)^3 0) u-i 

iH (d CU iH o 

to o 

•H Oi to 

^ <d u 

O 4-) H -H 

Ha(d(doa)(d 

CO to M-l (d rH a O 

•HaJPTJbO) <u toco cm 
+)^caa)04J^io (d 
6^:Hi-iidOJMa) o) 



O aO»(D0HH to 

M'd-H(dH>(d 0 

0) a 4J ftXJ 4J^^rH 0) W 
fl-HflOU^WOEHtdUCd 
^ - M <U.-l 



to O 0) 0) 
•H o o n 

CO JJ (d CQ 1 



H 

SOi M 0) 0)0% ^ 
0) QJ (d CO M (d 

U to P H -H (d N 

ord 'dP'do---H 

tQb)ftl<D>iO^ cr»-H 
o ^ftfl-dH a (d >^ 



•H -H :3 (d H p rH XJ 



rj (d M Q 

b) Q ^ u 

d 3 <D O 

a (d (d d 

(d id-H (d o 

V^ 4-> 4J -H 
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to 

to (d 

•rj .H 



CO 044 JiU 

b CO vo 

o CO (dnd 

H O > 0) ,1^ Pi 
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I H ft to 
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^ CO 4J O 
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H 4J G) W tH ^ « _Q 00 
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S Oi 
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N UtH S 
H O S 
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^ i Si SJ <R & 2d MQjpi&M 
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4J to iH CN M-« 



o I M >i rdoa 
ux^ a)xJ gen 



O ^ O iH -H 

— ■ to 



'd 

0 0) 

> CO 
Q M rtJ 

g (d 0) 
3 H tn 

^ P-rj 

o 'd 
0) to 
p (d CO 
CO > - 
CO p 

•HMO 
4J X5 P 
_ 0) D) 

g M d 

P Q) -H 



0) (d P 

<u g ^ 

M OTJ CQ 

<1) H OJ M 

u M-j 'd (d 

P 0) 

Td Tt rH >i -H 
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(d -H 

0) 
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-d 
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M nj 4-» >i 
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_ X^ &I 3 
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S (d 
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XJ d 

44 ft 5. H 
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O 0) _XJ 
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OJ-H 
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H CO 
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rH U (d 
O PTJrH 
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E H ^'d Q) 
T) cd rH rH 

5 43 - o Cd 
3 p >, d QJ 
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1^ I: 
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What is claimed is: 

1 . An isolated polypeptide selected from tbe group consisting of: 

a) a polyp^de comprising an amino add sequence selected from the group consisting of 
5 SEQIDN0:l-6. 

b) a naturally occurring polypeptide comprising an amino add sequence at least 90% 
id^cal to an amino add sequ^e sdected from ttie groiqp consisting of SEQ ID N0:l-6» 

c) a biologicaUy active fragment of apolypeptide having an amino add sequ^ce selected 
from the ffoap consisting of SEQ ID NO:l-6, and 

10 d) an immunogenic fragment of a polypeptide having an amino add sequence sdected from 

the group consisting of SEQ ID NO: 1-6. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1-6. 
15 3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide ^coding a polypeptide of claim 2. 

5. An isolated polynucleotide of daim 4 selected from the group consisting of SEQ ID 
20 NO:7-12. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of daim 3. 

25 7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of daim 6. 

9. A method for producing a polypeptide of claim 1, the method comprising: 

30 a) culturing a cell under conditions suitable for expression of the polypq)tide, wherein said 

cell is transformed with a recombinant polynucleotide, and said recombinant polynudeotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

b) recovering the jwlypeptide so expressed. 
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10. An Isolated antibody which specificafly binds to a polypeptide of claim 1. 

11. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide conqndsing a polynucleotide sequence sdected from the group consisting 
5 ofSEQIDNO:7-12, 

b) a naturally occurring polynucleotide comprising a polynucleotide sequ^ce at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:7-12, 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 
10 e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide conq)rising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

15 1 3. A method for d^ecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample wilh a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 

20 complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization conq)lex, and, optionally, if 
present tiie amount thereof. 

14. A method of claim 13, wherem the probe comprises at least 60 contiguous nucleotides. 

25 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

30 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

16. A conyposition comprising a polypeptide of daim 1 and a pharmaceutically acceptable 
excipient 

35 
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17. A composition of claim 1 6, wherein the polypq)ti(le has an amino acid sequence selected 
firom the group consisting of SEQ ID NO: 1-6. 

18. A method for treating a disease or condition associated with decreased expression of 

5 functional NTT, comprising adnunisterlng to a patient in need of such treatment the composition of 
claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1 , the method comprising: 

10 a) e3q)osing a sample comprising a polypeptide of daim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identitied by a method of claim 19 and 
a phannaceutically acceptable excipient 

15 

21 . A method for treatmg a disease or condition associated with decreased e3q)ression of 
functional NTT, comprishig administ^ing to a patient in need of such treatment a composition of 
claim 20. 

20 22. A method for screening a compound for effectiveness as an antagonist of a polypq)tide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a phannaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
NTT, comprismg administering to a patient in need of such treatment a composition of claim 23. 

30 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the stq)s of: 

a) combining the polypeptide of claim 1 with at least one test compound vmder suitable 
conditions, and 



103 



wo 01/90148 



PCT/USOl/16283 



b) deteclingblDdingcxfllie polypeptide of clam 
coiqpouiKl that specifically binds to the polypeptide of claim 1 . 

26. A method of scre^iuig for a coiqpound ^t modulates the activity of the polypeptide of 
5 claim 1 , said method con^rising: 

a) combimng thepolypq)tide of claim 1 with at least one test coixQ>oimd under conditions 
permissive for the activity of the polypeptide of dahn 1, 

b) assesshig the activity of the polypq^de of daunl in the presence of 

c) con5)aring the activity of the polypeptide of claim 1 in the presence of th^ 

10 with the activity of the polypeptide of claim 1 in the absence of the test conq)oiind, wherein a change in 
the activity of the polypeptide of claim 1 in the pres^ice of the test con^mid is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

27. A method for Screening a con;)Ound for effectiveness in altering e:q)ression of a target 
15 polynucleotide^ vdierdn said target polynucleotide comprises a sequaice of claim 5, the mettiod 

CQnq)rising: 

a) e3g)osmg a sanq)le conqxcising the target polynucleotide to a con^KHmd, under conditions 
suitable for the expression of the target polynucleotide^ 

b) detecting altered e3q[n-ession of the target pQlynucleotide» and 

20 c) comparing the expression of the target polynucleotide in the presence of varying flmo^iT^s of 

the compound and in the absence of the cooDopound. 

28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

25 b) hybridizing tlie nucleic acids of tiie treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization comply is formed between said probe and a target polynucleotide in the biological 
sample* said target polynudeotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragm^t thereof; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, whardn a difference in the 
amount of hybridization comply in the treated biological sanople is indicative of toxicity of the test 
compound. 
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29. A diagnostic test for a condition or disease associated with the expression of NTT in a 
biological sample comprising the steps of: 

a) combining the biological sample with an antibody of claim 10, under conditions suitable 
5 for the antibody to bind the polypeptide and form an antibody:polypepti<le complex; and 

b) detecting the complex, wherein the presence of the complex correlates with the presence 
of the polypeptide in the biological sample. 

30. The antibody of claim 10, wherein the antibody is: 
10 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab')2 fragment, or 

e) a humanized antibody. 

15 

31. A composition comprising an antibody of claim 10 and an acceptable exdpient 

32. A method of diagnosing a condition or disease associated with the expression of NTT in 
a subject, comprising administering to said subject an effective amount of the composition of claim 

20 31. 

33. A composition of claim 31, wherein the antibody is labded. 

34. A method of diagnosing a condition or disease associated with the e:q)ression of NTT in 
25 a subject, comprising administaing to said subject an effective amount of ttie conqwsition of daim 

33. 

35. A method of preparing a polyclonal antibody with tht specificity of the antibody of claim 
10 comprising: 

30 a) immunizing an animal with a polypq)tide having an amino add sequ^ce selected from 

the group consistmg of SEQ ID NO: 1-6, or an hnmunogenic fragment thereof, under conditions to 
elidt an antibody response; 

b) isolating antibodies from said anhnal; and 
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c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypq>tide having an amino add sequence selected irom the 
group consisting of SEQ ID NO: 1-6. 

5 36. Anantibodyproducedby a method of claim 35, 

37. A composition comprising the antibody of claim 36 and a suitable carrier. 

38. A method of making a monoclonal antibody with the specificity of the antibody of claim 
10 10conq)rising: 

a) immimizing an animal with a polypeptide having an amino acid sequence selected from . 
the group consisting of SEQ ID NO: 1-6, or an immunogenic fragment thereof, under conditions to 
elicit an antibody response; 

b) isolating antibody producing cells from the animal; 

15 c) fusing the antibody producing cells with inmiortalized cdls to form monoclonal antibody- 

produciQg hybridoma cells; 

d) culturing the hybridoma cells; and 

e) isolating from the culture monoclonal antibody which binds specifically to a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-6. 

20 

39. Amonoclonalantibodyproducedby a method of claim 38. ' 

40. A conq)osition comprising the antibody of claim 39 and a suitable canrier. 

25 41. The antibody of daim 10, whofeintlie antibody is produced by screening a Fab 

repression library. 

42. Ihe antibody of daim 10, whmin the antibody is produced by saeening a recombinant 
immunoglobulin library . 

30 

43. A metiiod for detecting a polypeptide having an amino acid sequence selected from tiie 
group consisting of SEQ ID NO: 1-6 in a sample, conqnising tiie steps of: 

a) incubatmg the antibody of claim 10 with a sample und» conditions to allow specific 
binding of tiie antibody and the polypq)tide; and 
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b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from fiie group consisting of SEQ ID NO: 1-6 in 
the sample. 

44. A method of purifying a polypeptide having an anuno acid sequence selected from the 
group consisting of SEQ ID NO: 1-6 from a sample, the method comprising: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide; and 

b) separating the antibody from the sample and obtainiag the purified polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-6. 

45. Apolypeptide of claim 1, con5)rising the amino acid sequence of SEQ ID N0:1. 

46. A polypeptide of claim 1, con5)rising the amino acid sequence of SEQ ID N0:2, 

47. A polypeptide of claim 1, conq)rising tiie amino acid sequence of SEQ ID NO:3. 

48. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

49. A polypeptide of claim 1, con5>rising the amino acid sequence of SEQ ID NO:5. 

50. A polypeptide of claim 1, comprismg the amino acid sequence of SEQ ID N0:6. 

51. A polynucleotide of claim 1 1. comprising the polynucleotide sequence of SEQ ID NO:7. 

52. A polynucleotide of clahn 1 1 , compdshig the polynucleotide sequence of SEQ ID NO:8. 

53. A polynucleotide of claim 11, comprismg the polynucleotide sequence of SEQ ID NO;9. 

54. A polynucleotide of claim 1 1 , comprising the polynucleotide sequrace of SEQ ID 

NO:10. 

55. A polynucleotide of claim 1 1, coiiq)rising tiie polynucleotide sequence of SEQ ID 

N0:11. 
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56. Apdlynndeotide of cLaim 11, conqnisisg thepolymicleotide sequence of SEQ ID 

N0:12. 
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Arg 






"3 n n 


Gxy 


Fne 


Gly 






Jib 


Asn 


Asn 


Cys 






i JO 


Fne 


Thr 


Ser 






A C 

J 4b 


Pne 


Lys 


Ala 






•3 


Glu 


rrtl_ 

Thr 


lie 






T T C 

J /b 


Tin 

xxe 


xxe 


Pro 






TOO 

J y u 


Tyr 


His 


T «-^ii 

ijeu 






A nc 
4ub 


P'hia 

xTie 


Pro 


AX a 






A o n 


Asn 


Lys 


Axa 






A*? 


C:xXU 


Axa 


Met 






4bU 


Fne 


pne 


Leu 






4bb 


Xnr 


lie 


Glu 






480 


Arg 


Lys 


Glu 






495 


He 


Gly 


Leu 






510 


Met 


Phe 


Asp 
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Seir 


7X1 




T.Qti T.^iii 
Licli. fI.\J iJ&iX 










53 0 




Ash 


He 


Ala 


Val 


v-jr to 


irllc VclX J-Y'L 










545 


Asp 


Leu 


Lys 


Asp 




UcsU. cLits 










560 




■''J 


Met 


Trp 


Lys 




XXcS tSxsL rl,\J 














He 


rixct 


Ser 


Val 




rvsxi rieu «xy 










R Q n 

Zj y \J 


Ala 


irp 


He 


Glu 


Asp 


ijys ajlcl oer 










fine; 

D U 3 




Thr 


irp 


Gly Leu 


va.± 


vax ^ys vaj. 










fi9n 








Val 


Pro 


vax 


va± irXie ±±& 














Asp 


ASp 


Ser 


Ser 


tsjLy 


ASH jjeu AX a 










o ^ u 




Ax 9" 


Val 


Leu 


Lys 




irxO Va± ASZl 










665 




Leu 


He 


His Gly 


Lys 


He Pro Ser 










680 




Gly 


Lys 


Asn 


He 


Tyr 


Arg Lys Gin 










695 




Thr 


Ala 


Pro 


Asn 


Gly 


Arg Tyr Gly 










710 


He 


Met 


Pro Asp 


Met 


Pro Glu Ser 



725 
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<400> 3 










Met 


Gin 


Asp 


Cys 


Gly 


Asp Pro Glu 


1 








5 




Ser 


Gly 


Pro 


Gly 


Gly 
20 


Ser Leu Pro 


Val 


Pro 


Ala 


Glu 


Gin 


Arg Pro Pro 










35 


Trp 


Arg 


Ser 


Ser 


Pro 


Arg Thr Trp 










50 


Ala 


Gly 


Val 


His 


Ala 


Pro Gly Met 










65 


Gly 


Pro 


Arg 


Ala 


Pro 
80 


Gly Gly Asp 


Glu 


Thr 


Pro 


Gly 


Trp 


Leu Ser Arg 










95 


Ser 


Ala 


Ala 


Ser 


Val 


Asn Leu Gly 










110 


Leu 


Gly 


Pro 


Phe 


Phe 


Pro Lys Glu 










125 


Asn 


Thr 


He 


He 


Gly 
140 


Met He Phe 


Leu 


Leu 


Ala 


Ser 


Leu 


Val Phe Gly 










155 


Ala 


Lys 


Phe 


Met 


Phe 


Val Ala Gly 










170 


Thr 


He 


Leu 


Phe 


Gly 


Val Leu Asp 










185 


Phe 


He 


Ala 


Met 


Cys 
200 


Phe Leu Val 



Leu 


He 
535 


Val 


Val 


He 


Leu 


Glu 
540 


Gly 


He 
550 


Asp 


Lys 


Phe 


Met 


Glu 
555 


Ala 


Pro 
565 


Ser 


Arg 


xyx 




570 


Leu 


Met 
580 


Leu 


Leu 


Ser 


Leu 


Leu 
585 




Ser 
595 


trJm \J 


JrX \j 


f2l V 
oxy 


xyr 


Asn 
600 




nl n 




Leu 


Ser 


Tyr 


Pro 




610 








615 


Cay 
OCX 


Leu 
625 


\Ta 1 
VAX 


Val 
VaX 




ai a 

AX a 


Tl a 

xxe 
630 


V CLX 


Arg 
640 


Arg 




Asn 


Leu 


Tl Q 

xxe 
645 


Ser 


vax 
655 


xhjl 


Tyr 


Lys 


Arg 


(jxy 
660 




Gill 

670 


vaxy 


Asp 




XliX 


Ser 
675 


Glu 


Met 
685 


Pro 


Ser 


Pro 


Asn 


Phe 
690 


Ser 


Gly 
700 


Ser 


Pro 


Thr 


Leu 


Asp 
705 


He 


Gly 
715 


Tyr 


Leu 


Met 


Ala 


Asp 
720 



Asp Leu 
730 



Ser 


Trp 


Lys 


Ala 


Leu 


Gly 


Ser 




10 








15 


Ala 


Gly 


Ser 


Arg 


Gly 


Cys 


Gin 




25 










30 


Ala 


Arg 


Val 


Pro 


Val 


Ala 


Pro 




40 










45 


Leu 


Glu 


Ala 


Gly 


Ser 


Trp 


Gly 




55 










60 


Glu 


Ala 


Leu 


Gly 


Asp 


Leu 


Glu 




70 










75 


Asp 


Pro 


Ala 


Gly 


Ser 


Ala 


Gly 




85 










90 


Glu 


Gin 


Val 


Phe 


Val 


Leu 


He 




100 










105 


Ser 


Met 


Met 


Cys 


Tyr 


Ser 


He 




115 






120 


Ala 


Glu 


Lys 


Lys 


Gly 


Ala 


Ser 




130 










135 


Gly 


Cys 


Phe 


Ala 


Leu 


Phe 


Glu 




145 










150 


Asn 


Tyr 


Leu 


Val 


His 


He 


Gly 




160 










165 


Met 


Phe 


Val 


Ser 


Gly 


Gly 


Val 




175 






180 


Arg 


Val 


Pro 


Asp 


Gly 


Pro 


Val 




190 










195 


Arg 


Val 


Met 


Asp 


Ala 


Val 


Ser 




205 








210 
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Phe 


Ala 


Ala 


Ala 


Met 
215 


Phe 


Pro 


Asn 


Asn 


Val 
230 


Ser 


Gly 


Leu Gly 


Leu 










245 


Tyr 


Gin 


Ser 


Phe 


Glv 
260 


Val 


Val 


Leu 


Leu 


Met 
275 


Tyx 


Glu 


Ser 


Asp 


Pro 
290 


Leu 


Pro 


Lys 


Val 


Glv 
305 


Ser 


Cvs 


Phe Gly 


Phe 










320 


Glu 


Lys 


Phe 


Asn 


Leu 
335 


Glv 


Met 


Ala 


Leu 


Ser 
350 


Leu 


Ser 


Asp 


Lys 


Arg 
365 


Gly 


Asn 


Leu 


He 


Thr 
380 


Pro 


He 


Leu 


His 


He 
395 


Leu 


Val 


Val 


Ser 


Glv 
410 


Phe 


Pro 


Glu 


He 


Leu 
425 


Glv 


Le\i 


Ser 


Thr 


Leu 
440 


Glv 


Ser 


He Gly 


Ala 










455 


Tvr 


Glu 


Lys 


He 


Glv 
470 


Trp 


Ala 


Leu 


He 


Ser 
485 


Glu 


Tyr 


Ser Arg 


Arg 










500 


Thr 


Glu 


Glu 


Glu 


Arg 
515 
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Met Cys 


Ala 


Glu 


Ser 


1 






5 


Gly Cys 


Gly 


Asn 


Arg 








20 


Ala Thr 


Lys 


Glu 


Lys 








35 


He Leu 


Lys 


Pro 


Ser 








50 


Asp Glu 


Ala 


Glu 


Gly 








65 


Lys He 


Asp 


Phe 


Val 








80 


Gly Asn 


Val 


Trp 


Arg 








95 



Thr 


Ala 


Ser 


Ser 


Ser 










220 


Ala 


Thr 


Val 


Leu 


Glv 










235 


He 


Leu 


Glv 


Pro 


Pro 










250 


Tvr 


Glu 


Val 


Pro 


Phe 










265 


Val 


Pro 


Leu 


Asn 


Met 










280 


Glv 


Glu 


His 


Ser 


Phe 










295 


Leu 


He 


Ala 


Phe 


Val 










310 


Leu 


Asp 


Pro 


Thr 


Leu 










325 


Pro 


Ala 


Glv 


Tvr 


Val 










340 




Ala 


He 


Ser 


Ser 










355 


Pro 


Pro 


Leu 


Arg 


xjysi 










370 


Ala 


Glv 


cys 




Met 










385 


Lys 


Ser 


Gin 


Leu 


'Vim 










400 


Leu 


Ser 


Ala 


Gly 


Met 










415 


Ser 


Cys 


Ala 


His 


Glu 










430 


Gly 


Leu 


Val 














445 


Phe 


Met 


Gly 














4 O V 


Phe 


Glu 


Trp 


Ala 


Ala 










475 


Gly 


Leu 


Ala 


Met 


Gly 










490 


Lys 


Arg 


Ser 


Lys 


•99X 










505 


Thr 


Thr 


Leu 


Leu 


Pro 










520 


71556695CD1 






Ala 


Asp 


Gly 


Ala 


Ala 










10 


Pro 


Pro 


Ser 


Thr 


Gin 










25 


Leu 


Gin 


Cys 


Leu 


Lys 










40 


Pro 


Gly 


Lys 


Ser 


Pro 










55 


Lys 


Pro 


Pro 


Gin 


Arg 










70 


Leu 


Ser 


Val 


Ala 


Gly 










85 


Phe 


Pro 


Tyr 


Leu 


Cys 










100 
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He 


Leu 


Ala 


Lys 


Ala 










225 


Ser 


Leu 


Glu 


Thr 


Phe 










240 


Val 


Glv 


Glv 


Phe 


Leu 










255 


He 


Val 


Leu 


Glv 


Cys 










270 


Tvr 


He 


Leu 


Pro 


Asn 










285 


Trt) 


Lys 


Leu 


He 


Ala 










300 


He 


Asn 


Ser 


Leu 


Ser 










315 


Ser 


Leu 


Phe 


Val 


Leu 










330 


Glv 


Leu 


Val 


Phe 


Leu 












Pro 


Leu 


Phe 


vaXjf 


XJISU 












xxp 


Leu 




VAX 


irLlts 












Leu 


Leu 


Gly 


JrXU 


VCiX 










•J ^ \J 


Leu 


Leu 


Val 


XJQLL 


Tl 0 

XX cs 










405 


Ser 


He 


He 




XXIX 










420 


Asn 


Gly 


Phe 


m 11 
V9XLL 


V3XU 












Leu 


JrUxs 


OCX 


AT a 

AXCL 














xjcix 


wxy 




IrXie 


Leu 










465 


Ala 


He 


Gin 


Gly 


Leu 










480 


Leu 


Phe 


Tyr 


Leu 


Leu 










495 


Gin 


Asn 


He 


Leu 


Ser 










510 

^ X V 


Asn 


Glu 


Thr 






Leu 


Glv 


Pro 


Ala 


Ser 










15 


Glu 


Ser 


Lys 


Glu 


Met 










30 


Asp 


Phe 


His 


Lys 


Asp 










45 


Gly 


Thr 


Arg 


Pro 


Glu 










60 


Glu 


Lys 


Trp 


Ser 


Ser 










75 


Gly 


Phe 


Val 


Gly 


Leu 










90 


Tyr 


Lys 


Asn 


Gly 


Gly 



105 
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Gly Ala Phe 
Leu Pro Val 
Glu Gly Gly 
Gly He Gly 
Tyr Tyr He 
Ser Phe Gin 
Asn Thr Pro 
Val Trp He 
Glu Phe Trp 
His Pro Gly 
Val Trp Leu 
.Thr Gly Lys 
Leu Leu Val 
Ala Gly He 
Asp Pro Gin 
Tyr Ala He 
Lys Tyr Lys 
Leu Asn Ser 
He Leu Gly 
Val Ala Glu 
Ala Val Thr 
Phe He Met 
Val Glu Gly 
Leu Arg Lys 
Ser He Ser 
Met Tyr Val 
Cys Leu Leu 
He Tyr Gly 
Gly Tyr Arg 
Thr Pro Val 
Tyr Val Pro 
Ala He Gly 
Val Pro Leu 
Phe Leu Val 



T.011 


He 
110 




Phe 


Phe 
125 


Leu 


He 


Thr 
140 


Cys 


Tvrr 


Ala 

155 


Ser 


Val 


He 
170 


Leu 


Lys 


Glu 
185 


Leu 


His 


to 

200 


Met 


Thr 


He 
215 


Ser 


Glu 


Arg 
230 


Asn 


Ser 


Leu 
245 


Lys 


Val 


Cys 
260 


Phe 


Val 


Val 
275 


xyi. 


Leu 


Leu 
290 


VCLX 


jjy o 


JrJUtS 

305 




yfciX 


Trp 
320 


X j.e 


Cys 


Leu 
335 


VaJLy 




Rem 

350 


OCX 




365 


oex 




380 


AT a 

Axa 




395 


Pro 




Met 
410 


Jr X V 




425 


Leu 


Gin 


He 
440 


Thr 


Glv 


Tvr 
455 


**X \J 


jf 


Leu 
470 


Leu 


Phe 


Gin 
485 


Leu 




Val 
500 


Ala 


Glv 


Asp 
515 


Asn 


Pro 


Gly 
530 


Pro 


Leu 


Cys 
545 


Val 


Leu 


Thr 


Tyr 




560 


Leu 


Gly 
575 


Trp 


Val 


He 
590 


Val 


Arg 


Val 


Lys 



Tyr 


Phe 


He 


Glu 


He 


He 


Trp 


Glu 


Lys 


Val 


Val 


He 


Ala 


Trp 


Ala 


Pro 


Trp 


Ala 


Glu 


Asp 


Thr 


Ser 


Thr 


Asn 


Val 


Leu 


Ser 


Trp 


Asp 


Leu 


Phe 


Cys 


He 


Phe 


Thr 


Ala 


Arg 


Gly 


Leu 


Leu 


Tyr 


Pro 


Asp 


Ala 


Gly 


Ala 


Met 


Thr 


Tyr 


Arg 


Asp 


Phe 


Val 


Ser 


Gin 


Glu 


Gin 


Gly 


Leu 


Ala 


Leu 


Pro 


Thr 


Leu 


Gly 


Leu 


Ser 


Leu 


Val 


Arg 


Glu 


He 


Gly 


Leu 


Thr 


Phe 


Asp 


Tyr 


Phe 


Phe 


Glu 


Leu 


Tyr 


Asp 


Trp 


Met 


Lys 


Gly 


Cys 


Phe 


Asn 


Lys 


Thr 


Ser 


Leu 


Ala 


He 


Arg 


Leu 


Tyr 


Leu 


Leu 



Phe 


Leu 


Phe 


115 






He 


Glv 


Gin 


130 






He 


Cys 


Pro 


145 






Val 


Ser 


Leu 


160 






Thr 


Tvr 

xjr X 


Tvr 

xy J. 


175 






His 


Cys 


Asn 


190 






Met 


Arg 


Lys 


205 






Phe 


Thr 


Ser 


220 






Leu 


Ser 


Pro 


235 






Ala 


Leu 


Cvs 


250 






xxp 


Lys 


Gly 


265 






Thr 


Phe 


Pro 


280 






Thr 


JJCStX 


Jr X \J 


295 






Asp 


T1 c* 


XIlx 


X V 






Xllx 


Gl Tl 
VaXli 


Tl o 
XX€ 


325 






0«5X 


- 

jjeu 


^wxy 












T. All 


355 






VvXjf 


JrXics 


Ala 
AXa 


370 




* 


oxy 


VCIX 


Asp 


385 






irxie 


Tl a 
xxe 


ai a 

nxa 


Ann 








xrp 


ser 


4 X ^ 






Asp 


Otsx 


Gl n 


430 






Asp 


Leu 


xyx 


445 






Phe 


He 


Ala 


460 






Met 


Val 


Thr 


475 






Tvi" 
xyx 


Ala 


Ala 


490 






Cys 


phe 


Val 


505 






Glv 


He 


Glu 


520 






xyx 


Ser 


xxp 


535 






He 


Phe 


Ser 


550 






Tyr 


Val 


Tyr 


565 






Leu 


ser 


Ser 


580 






Cys 


Gin 


Thr. 


595 




Thr 


Pro 


Arg 



Glv 


Ser 


Glv 
120 


Tvr 


Thr 


Ser 
135 


Leu 


Phe 


Ser 
150 


Leu 


Asn 


Val 
165 


Leu 


Phe 


Gin 
180 


His 


Ser 


xxp 
195 


Asn 


Lys 


Ser 
210 


Pro 


Val 


He 
225 


Gi V 


He 


Asp 
240 


Leu 


Leu 


Leu 
255 


Val 


AX y 


Ser 
270 


Phe 


Ala 


Met 
285 


Gly 


Ala 


Glv 
wxy 

300 


AX 


Ti£l11 
XJcLl 


Gl n 

315 






OCX 

330 


•9CX 


xyx 


A9XX 

345 


Leu 


Glv 
oxy 


360 


Tie 

X XC5 


Pha 


iScx 

375 


Tie 

X X IS 


nxci 


Ak3^ 

390 


xyx 


JCrX SI 


uy 0 

405 


Tl c» 

XX 


JJCU 


PVio 

IrXicS 

420 


Phe 


Val 


Glu 
435 


Pro 


Ser 


Phe 
450 


Phe 


Val 


Cvs 
465 


Glu 


Glv 
\jxy 


Glv 
\axy 

480 


Ser 


Glv 


Val 
495 


He 


Ala 


Tim 
xxp 

510 


Asp 


Met 


He 
525 


Ala 


Val 


He 
540 


Leu 


Val 


xjy o 

555 


Pro 


Asn 


Trp 
570 


Met 


Leu 


cys 
585 


Glu 


Gly 


Pro 
600 


Glu 


Pro 


Asn 
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605 

Arg Trp Ala Val Glu Arg Glu Gly 
620 

Thr Val Met Asn Gly Ala Leu Val 
635 

Glu Thr Met Met 
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Met 


Ser 


Lys 


Ser 


Lys 


Cys 


Ser 


Val 


1 








5 








Ala 


Pro 


Ala 


Lys 


Glu 


Pro 


Asn 


Ala 








20 








Leu 


lie 


Leu 


Val 


Lys 


Glu 


Gin 


Asn 










35 








Thr 


Leu 


Thr 


Asn 


Pro 


Arg 


Gin 


Ser 










50 






Glu 


Thr 


Trp 


Gly 


Lys 


Lys 


He 


Asp 










65 








Phe 


Ala 


Val 


Asp 


Leu 


Ala 


Asn 


Val 










80 








lyr 


Ujr 9 


2i on 

AO IX 








Ala 


irne 










95 








Met 


Val 


He 


Ala 


Gly 


Met 


Pro 


Leu 










110 








Gly 


Gin 


Phe 


Asn 


Arg 


Glu 


Gly 


Ala 










125 








Pro 


He 


Leu 


Lys 


Gly 


Val 


Gly 


Phe 










140 








Tyr 


Val 


Gly 


Phe 


Phe 


Tyr 


Asn 


Val 










155 








Tyx 


Leu 


Phe 


Ser 


Ser 


Phe 


Thr 


Thr 










170 








Asn 


Asn 


Ser 


Trp 


Asn 


Ser 


Pro 


Asn 










185 








Asp 


Ser 


Ser 


Gly 


Asp 


Ser 


Ser 


Gly 










200 








Thr 


Pro 


Ala 


Ala 


Glu 


Tyr 


Phe 


Glu 










215 






Gin 


Ser 


His 


Gly 


He 


Asp 


Asp 


Leu 










230 








Thr 


Ala 


Cys 


Leu 


Val 


Leu 


Val 


He 










245 








Trp 


Lys 


Gly 


Val 


Lys 


Thr 


Ser 


Gly 










260 








Thr 


Met 


Pro 


Tyr 


Val 


Val 


Leu 


Thr 










275 








Thr 


Leu 


Pro 


Gly 


Ala 


He 


Asp 


Gly 










290 








Asp 


Phe 


Tyr 


Arg 


Leu 


Cys 


Glu 


Ala 










305 








Thr 


Gin 


Val 


Cys 


Phe 


Ser 


Leu 


Gly 



320 



610 615 
Ala Thr Pro Tyr Asn Ser Arg 

625 630 
Lys Pro Thr His He He Val 

640 645 



Gly 


Leu 


Met 


Ser Ser 


Val 


Val 




10 








15 


Val 


Gly 


Pro 


Lys Glu 


Val 


Glu 




25 








30 


Gly 


Val 


Gin 


Leu Thr 


Ser 


Ser 




40 








45 


Pro 


Val 


Glu 


Ala Gin 


Asp 


Arg 




55 






60 


Phe 


Leu 


Leu 


Ser Val 


He 


Gly 




70 








75 


Trp 


Arg 


Phe 


Pro Tyr 


Leu 


Cys 




85 








90 


Leu 


Val 


Pro 


Tyr Leu 


Leu 


Phe 




100 








105 


Phe 


Tyr 


Met 


Glu Leu 


Ala 


Leu 




115 








120 


Ala 


Gly 


Val 


Trp Lys 


He 


Cys 




130 








135 


Thr 


Val 


He 


Leu He 


Ser 


Leu 




145 








150 


He 


He 


Ala 


Trp Ala 


Leu 


His 




160 








165 


Glu 


Leu 


Pro 


Trp He 


His 


Cys 




175 








180 


Cys 


Ser 


Asp 


Ala His 


Pro 


Gly 




190 








195 


Leu 


Asn 


Asp 


Thr Phe 


Gly 


Thr 




205 








210 


Arg 


Gly 


Val 


Leu His 


Leu 


His 




220 








225 


Gly 


Pro 


Pro 


Arg Trp 


Gin 


Leu 




235 








240 


Val 


Leu 


Leu 


Tyr Phe 


Ser 


Leu 




250 








255 


Lys 


Val 


Val 


Trp He 


Thr 


Ala 




265 








270 


Ala 


Leu 


Leu 


Leu Arg 


Gly 


Val 




280 








285 


He 


Arg 


Ala 


Tyr Leu 


Ser 


Val 




295 








300 


Ser 


Val 


Trp 


He Asp 


Ala 


Ala 




310 








315 


Val 


Gly 


Phe 


Gly Val 


Leu 


He 




325 








330 
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Ala 


XrLXvs 


Coy* Cot* 




As XI 


Lys 


irXie 








335 










VcLX XxiX 


Tlir 


Ser 


Tl f> 
X Xc 


Asn 
















Gly 


Phe 


Va 1 \7a ^ 




ser 


roller 


Leu 








365 








Ser 


Val 


*r X w X Xc 


Gl V 




Val 
VcLX 


■cixa 








380 








Leu 


His 


xjcu \j xy 


Leu 


ucix 


A TTCT 
AX ^ 


\Ta 1 
VclX 








J ^ -> 








Gin 


Tiir 


GesT Ala 


xxp 


Ala 

AXCl 


Val 

VCLX 


\7"a 1 
VcLX 








410 








Leu 




XXc: 


9@X 


Al a 

Axa 


Mai- 
ne u 


variy 














Gly 


Leu 


T 1 o Hen 
xxts A£>J^ 


Gl n 




WsXIi 


Leu 






AAQ 












JJ9U. c'xie 


X j.e 


Vax 


Leu 


Aia 




















Xxix ziSZl 


«iy 




Xxe 










<fc / u 








A1 a 
A J. a 


2V 1 a fill w 


Tlir 


Leu 


Aiet 


ASp 


















vcix v»±y 




irne 


Ser 


ASp 














Gin 


AX ^ 


pT*rt Ocv- 


Leu 


Tyr 


Trp 


Arg 








3X^ 








Pt"o 
Jr i U 




Xrllc JjcU. 


Leu 


irXie 


vax 


vax 








3 J u 












Pro Hxs 


Tyr 


ijxy 


Axa 


Tyr 














211 a 


_ 

jjeu 


vaiy ixp 


vax 


X le 


Ala 


Tnr 








560 








He 


Tyr 


Ala Ala 


Tyr 


Lys 


Phe 


Cys 








575 






Glu 


Lys 


Leu Ala 


Tyr 


Ala 


He 


Ala 








590 








Val 


Asp 


Arg Gly 


Glu 


Val 


Arg 


Gin 








605 








Leu 


Leu 


Gly Leu 


Ser 


Arg 


Gly 


Asn 



620 

<210> 6 

<211> 592 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 7475553CD1 

<400> 6 



Met 


Glu 


Lys 


Ala 


Arg 


Pro 


Leu 


Trp 


1 








5 






Phe 


Ala 


Cys 


He 


Ser 
20 


Tyr 


Ala 


Val 


Phe 


Pro 


Tyr 


Leu 


Cys 
35 


Gin 


Met 


Tyr 


Pro 


Tyr 


He 


He 


Met 
50 


Leu 


He 


Val 


Leu 


Glu 


Leu 


Ala 


Val 


Gly 


Gin 


Arg 










65 






Ala 


Trp 


Arg 


Thr 


He 


Ser 


Pro 


Tyr 










80 






Ser 


Val 


Val 


Val 


Ser 
95 


Phe 


Phe 


Leu 


Asn 


Ala 


Trp 


Ala 


Phe 


Trp 


Tyr 


Leu 



110 



Thr 


Asn 


Asn 


cys xyx Axy 


Asp 




340 








Ser 


Leu 


xnr 


oer file £>er 


Ser 




355 






J oU 


Gly Tyr 


nei. 


Til^ Oivs T^/^c^ 

Axa trxn Liys 


flxS 




370 






"3 T C 


Lys 


Asp 


v»xy 


vsXU r'ro jjeu 


Leu 




385 






7 Q A 

J y u 


Ala 


Pro 


Ala 

Axa 


Oav Oav* 

jrro oex oer 


vvxn 




400 






4U3 


Phe 


Phe 


Tl o 

xxe 


Itf^^^ T*^*ii T-^iii 

net xieu xieu 


Thr 




415 








Gly Met 


mil 


s>eT vai xie 


Thr 




430 






4 J 3 


Leu 


His 


Arg 


nlS Arg GxU 


Leu 




445 






Thr 


Phe 


Leu 


Leu Ser Leu 


Pne 




460 






403 


Val 


Phe 


Thr 


Leu Leu Asp 


His 




475 




480 


Thr Asp 


Aia 


Thr Ser Ser 


Pro 




490 






yi Q c 


Asp 


He 


Vain 


v»in Mec Tnr 


Gly 




505 






510 


Leu Cys 


Trp 


jjys Leu val 


Ser 




520 






3Z3 


Val 


Val 


•ber 


xie val Tnr 


Phe 




535 






340 


He 


Phe 


Pro 


Asp Trp Ala 


T^n 




550 






555 


Xaa 


Ser 


Met 


Ala Met Val 


Pro 




565 






570 


Ser 


Leu 


Pro 


Gly Ser Phe 


Arg 




580 






585 


Pro 


Glu 


Lys 


Asp Arg Glu 


Leu 




595 






600 


Phe 


Thr 


Val 


Arg Ser Arg 


Ser 




610 






615 


Ser 


Glu 










625 









Ala Asn 


Ser 


Leu Gin Phe 


Val 


10 






15 


Gly Leu 


Gly 


Asn Val Trp 


Arg 


25 






30 


Gly Gly 


Gly 


Ser Phe Leu 


Val 


40 






45 


Glu Gly 


Met 


Pro Leu Leu 


Tyr 


55 






60 


Met Arg 


Gin 


Gly Ser He 


Gly 


70 






75 


Leu Ser 


Gly 


Val Gly Val 


Ala 


85 






90 


Ser Met 


Tyr 


Tyr Asn Val 


He 


100 






105 


Phe His 


Ser 


Phe Gin Asp 


Pro 


115 




120 
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Leu Pro Trp 
Asp Glu Glu 
Arg hys Thr 
Val Gin Trp 
Val Tyr Leu 
Val Tyr Phe 
Leu lie Arg 
Tyr Met Phe 
Trp He Asn 
Phe Gly Ser 
Asn cys Gin 
Thr Ser He 
Lys Ala Thr 
Leu Leu Thr 
Ser Asn Leu 
Ser Lys Tyr 
Glu Ser Glu 
He Val Tyr 
Trp Ser Val 
Ser Met Leu 
Ser Lys He 
Leu Val Cys 
Glu Ala Gly 
Thr Leu Ser 
Cys Tyr Val 
Met Thr Gly 
Gly Val Ser 
Ser Asp Tyr 
Ala Ser Gin 
Leu Ala Val 
Pro Leu Ala 
Gly Asp Ala 

<210> 7 
<211> 2168 



Ser 


Val 


v^y o 




125 






Glu 


uy o 




140 




Leu 


Asn 


He 




155 




Glu 


Pro 


Ala 




170 




V- jr o 


X XC: 






X03 




JLiXX 


AT ;q 
AX a. 


Ser 




^ u u 






Leu 


TJtir 




^ X D 




nTU -y. 
XXIX 


Pro 


Lys 








Ala 


Ala 


Xllx 




245 






xx^ 


Ala 
AX a 




^ ou 




Lys 


nxo 


Axa 




^ f D 




Irue 


AXa 


Ser 










Asn 


Tyr 








Asn 


Thr 


jpne 








nil 11 




vax 




J 3 






vXU. 






J ^ V 




Leu 


Ao^ 


XJLiX 








xux 


v?xu. 


aXA 








- 

Leu 


Tyr 


irne 










Asn 


Tnr 




4XU 




X JLe 


Ser 


Ser 








_ 

Leu 


VaX 


Asn 










xyr 


Trp 








Leu 


XJ^U 


Leu 




*± / \j 






v9xy 


Leu 




485 




AX ^ 


Ala 


VCIX 




500 




XT X U 


XiC=LL 


- 

Leu 




515 




He 




XXIX 




530 




Gly 


Gin 

varX4i 


TiOii 
XietL 




545 




He 


Gly 


Leu 




560 




Ala 


Leu 


Gly 




575 


Asp 


Pro 


Val 




590 





Pro 


Leu 


Asn 


Ala 


Ser 


Ser 


Ser 


Pro 


Ser 


Leu 


Cys 


Leu 


Arg 


Gly 


Thr 


Leu 


Pro 


Tyr 


Leu 


His 


Gly 


He 


Glu 


Gin 


Gin 


He 


Phe 


Phe 


Ala 


Ser 


He 


He 


Val 


He 


Val 


Thr 


Glu 


Asn 


Cys 


Asp 


Leu 


Glu 


Lys 


Gly 


TVr 


Phe 


Pro 


Gin 


Ala 


Val 


Gin 


He 


Lys 


Asn 


Phe 


Met 


Leu 


Ala 


Ala 


He 


His 


Leu 


Pro 


cys 


Ala 


He 


Phe 


Asp 


He 


He 


Val 


Leu 


Arg 


Arg 


Phe 


Ser 


Trp 


Tyr 


He 


Val 


Ser 


Gly 


Thr 


Leu 


Val 


Thr 


Lys 


Leu 


Val 


Ala 


Thr 


Phe 


Val 


Ala 







v^xy 


Asn 


nxo 


X J \J 






Thr 


Gin 


xyx 


145 






Leu 


Gin 


Gil] 


160 






Leu 


Leu 


AXCL 


175 






V3X LL 


ser 


Xllx 


X 17 U 






Cys 


Va.X 


Leu 


^ uo 






aXo. 


xnr 


Asn 








Leu 


H 1 a 


Asn 


Z J D 








0«5X 


Leu 


250 






x-yr 


AoXJ. 


Pill 
^xu 


265 






dcsx 


- 

Leu 


xxe 








irne 


Ser 


xxe 








Leu 


Lys 


Lys 


11 n 
J xu 






Asp 


*jxy 


jpne 


IOC 






Leu 


Axa 


Ser 


1 AH 






xxe 


Lys 


Asn 


ICC 






IjXy 


XHx 


laxy 


i7n 






ciei. 


(.3XU 


vax 


IftC 

O OS 






Leu 


jxiex. 


Leu 


Ann 

4 u u 






Leu 


Thr 


Pro 


4Xo 






Lys 


6XU 


Axa 


Ai n 

^fc J u 






v»xy 


ixieu 


vax 








xrne 


Asn 


Asp 


A^n 

4 o u 




vdx 


m 11 

VsXiX 


xnr 


475 






V9XU 


Cay 
•3C5X 


Asp 


490 






xrp 


- 

Lys 


T7a 1 
VaX 








Leu 




VaX 


520 






Ltys 


Tyr 


r*1 n 
vaXn 


535 






Asp 


Tyr 


Pro 


550 






Ser 


Ser 


Thr 


565 






Gin 


Arg 


Arg 


580 







Thr 


Gly 


xy^: 






135 


Phe 




xy X 






150 


Asn 


Gly 


Glv 
oxy 






165 


xxp 


Leu 


Val 






180 


vjxy 


juy & 


Val 

VCLX 






195 


XXcs 


Tl c» 

XXts 


xyr 






01 n 
zxu 




Leu 








ooc 


JCrX W 


xjy B 


Al a 






240 


v»xy 


TiOii 
XJCU. 


v^xy 








It X V 


ser 


Asn 






o7n 


Asn 


Ser 


Jrue 






OftC 


xyr 


fil vr 
laxy 


irne 






1 nn 


vax 


Ser 


Leu 






1 1 c 
JX3 


Leu 


Xllx 


1 9 

Axa 






11 rt 
J J u 


a 

Axa 


Tyr 


Pro 






lA^ 


v^y o 


ocsx 


Leu 






1 fin 


_ 

Lieu 


ai a 
Axa 


Jrns 






17C 


ser 


1.9X11 


Leu 






ion 


v»xy 


Tl £1 

xxe 


vaxy 






A nc 


Leu 


Tnr 


Asp 






/on 


xxe 


Ser 


VarXy 






Al C 
4 J 3 


irne 


Tnr 


jxieu 






Acn 


Tyr 


an a 

aXo. 


Ala 

Axa 








XXfes 


Al a 

AXA 


Val 
VclX 






480 


TtAll 
JJtSlX 




Al a 

AXd 








Met 


xtp 


Al a 

AXd 






m n 

«J X u 


Phe 


xyx 


TtOii 






525 


Ala 


Tim 
xxp 


AS ^ 






540 


Ala 


Tyr 


Ala 






555 


Met 


Cys 


He 






570 


Leu 


Lys 


Arg 






585 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7617689CB1 

<400> 7 

ccacgcgtcc gtaaggtggg atggatagca gggtctcagg cacaaccagt aatggagaga 60 

caeiaaccagt gtatccagtc atggaaaaga aggaggaaga tggcaccctg gagcgggggc 120 

actggaacaa caagatggag tttgtgctgt cagtggctgg ggagatcatt ggcttaggca 180 

acgtctggag gtttccctat ctctgctaca aaaatggggg aggtgccttc ttcatcccct 240 

acctcgtctt cctctttacc tgtggcattc ctgtcttcct tctggagaca gcactaggcc 300 

agtacactag ccagggaggc gtcacagcct ggaggaagat ctgccccatc tttgagggca 360 

ttggctatgc ctcccagatg atcgtcatcc tcctcaacgt ctactacatc attgtgttgg 420 

cctgggccct gttctacctc ttcagcagct tcaccatcga cctgccctgg ggcggctgct 480 

accatgagtg gaacacagaa cactgtatgg agttccagaa gaccaacggc tccctgaatg 540 

gtacctctga gaatgccacc tctcctgtca tcgagttctg ggagcggcgg gtcttgaaga 600 

tctctgatgg gatccagcac ctgggggccc tgcgctggga gctggctctg tgcctcctgc 660 

tggcctgggt catctgctac ttctgcatct ggaagggggt gaagtccaca ggcaaggtgg 720 

tgtacttcac ggccacattt ccttacctca tgctggtggt cctgttaatt cgaggggtga 780 

cgttgcctgg ggcagcccaa ggaattcagt tttacctgta cccaaacctc acgcgtctgt 840 

gggatcccca ggtgtggatg gatgcaggca cccagatatt cttctccttc gccatctgtc 900 

ttgggtgcct gacagccctg ggcagctaca acaagtacca caacaactgc tacagggact 960 

gcatcgccct ctgcttcctc aacagcggca ccagctttgt ggccggcttt gccatcttct 1020 

ccatcctggg cttcatgtct caggagcagg gggtgcccat ttctgaggtg gccgagtcag 1080 

gccctggcct ggctttcatq gcttacccgc gggctgtggt gatgctgccc ttctctcctc 1140 

tctgggcctg ctgtttcttc ttcatggtcg ttctcctggg actggatagc cagtttgtgt 1200 

gtgtagaaag cctggtgaca gcgctggtgg acatgtaccc tcacgtgttc cgcaagaaga 1260 

accggaggga agtcctcatc cttggagtat ctgtcgtctc cttccttgtg gggctgatca 1320 

tgctcacaga gggcggaatg tacgtgttcc agctctttga ctactatgca gccagtggca 1380 

tgtgcctcct gttcgtggcc atcttcgagt ccctctgtgt ggcttgggtt tacggagcca 1440 

agcgcttcta cgacaacatc gaagacatga ttgggtacag gccatggcct cttatcaaat 1500 

actgttggct cttcctcaca ccagctgtgt gcacagccac ctttctcttc tccctgataa 1560 

agtacactcc gctgacctac aacaagaagt acacgtaccc gtggtggggc gatgccctgg 1620 

gctggctcct ggctctgtcc tccatggtct gcattcctgc ctggagcctc tacagactcg 1680 

gaaccctcaa gggccccttc agagagagaa tccgtcagct catgtgccca gccgaggacc 1740 

tgccccagcg gaacccagca ggaccctcgg ctcccgccac ccccaggacc tcactgctca 1800 

gactcacaga gctagagtct cactgctagg gggcaggccc ttggatggtg cctgtgtgcc 1860 

tggccttggg gatggctgtg gagggaacgt ggcagaagca gccccatgtg cttccctgcc 1920 

cccgacctgg agtggataag acaagagggg tattttggag tccacctgct gagctggagg 1980 

cctcccactg caacttttca gctcaggggt tgttgaacag atgtgaaagg ccagtgccaa 2040 

gagtgtccct ctgagaccct tgggaagctg ggtgggggct ggtaggtggg gcgagacttg 2100 

ctggcttcgg gccctctcat ccttcattcc attaaatcca cattcttccc gctgaaaaaa 2160 

aaaaaaaa 2168 

<210> 8 
<211> 2709 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 68816 69CB1 

<400> 8 

catcctcggt ctttacttcc tatcaagtaa cattgttttg aaaaatagag ttaacacatt 
tgccataagg gagttttttt ttttttttta aattacttcg aatactctcc aaatgcccaa 
aaatagcaag gtggtaaaaa gagaattaga tgatgatgtt actgagtctg tcaaagacct 
tctttccaat gaagacgcag ctgatgatgc ttttaagaca agtgaactaa ttgttgatgg 
ccaggaagag aaagatacag atgttgaaga aggatctgaa gtcgaagatg aaagaccagc 
ttggaacagt aaactacaat acatcctggc ccaagttgga ttttctgtag gtttaggaaa 
tgtgtggcga tttccatacc tatgtcagaa gaatgggggc ggtgcatatc ttttaccata 
tttaatacta cttatggtaa taggtattcc cctttttttc ttggaactct ctgtgggtca 
aagaattcgg cgaggcagca ttggtgtatg gaattacata agccctaaac tgggcgggat 
tggatt.tgca agttgtgtag tgtgctattt tgtagctctc tactacaacg tcatcattgg 
ctggagtttg ttttattttt ctcagtcttt tcagcaaccc ctgccttggg atcagtgtcc 
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tttggtgaaa aatgcttcac acacttttgt 
cacctattac tggtacaggg aagcactgaa 
cttaaactgg aagatgacca tctgcttgtt 
gatcaaaggc attcagtctt ctggaaaaat 
ggtacttatt tgcttcctca tcagagcatt 
ccacafcgttt acccctaagc ttgaaataat 
tactcaagtg ttctttgcct taggtctggg 
caacaagaga gacaacaact gccactttga 
cacttctgtc ctggcaacat tggtggtgtt 
aaatgagaaa tgcattacac aaaattcaga 
cattagtcag gatattattc cccatcatat 
tcatttagtt tatgacatca ttcaaaaagt 
caattcctgt aaaattgaag aagagctaaa 
tattgccttt acagaagcga tgacacattt 
tttcctcatg ctggtcaatc taggccttgg 
cacgcctatt gtggacactt tcaaagtgag 
tctggcattt tgtattggcc tgatatttgt 
gtttgatgat tattctgcta cactgcctct 
tgtatgcttt gtttatggca tagataagtt 
tgctcccagc agatattact actatatgtg 
attgctaata gctagtgttg tgaatatggg 
tgaagataag gcatctgaag aatttctgag 
ctctctggtt gtctttgcaa tactcccagt 
ccttatagat gatagttctg gtaatttagc 
gaaagagcct gtgaacttag agggcgatga 
cgagatgcca tctccaaatt ttggtaaaaa 
tctggatact gctcccaatg gacggtatgg 
agatatgcca gaatctgatt tgtagctggg 
ttttaccaat gaacattggc cctagtagga 
atctcaggtg ttccgtggct gtgatcttta 
gcattctttt ggattctttg gtttacattt 
agggctgagg tacatgtttg ccaggatttt 
atttctatct cttaaaaaaa tggtattacc 
tgttgacaat taaaaatggt atacattaaa 
gtgtcacac 

<210> 9 

<211> 2958 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2742486CB1 

<400> 9 

caggtccaag agattcttct atctcagcct 
tggaccaacg ctgggtggat ttctgtatga 
acaaggtcta tgggctctga taagttgatt 
ctccagtaga taaaacaagc ccatggctaa 
agctgctgcc cattcaaaac caattttctc 
aaaagcacca attgaccaca ttgcactaaa 
tccctcttca aacccatttt catgtgcaca 
actcattcca gcagagaggc cacttacaac 
tttaatatgc aagattgcgg ggacccagag 
ggcggatcgc tgcctgcagg gagtcgggga 
gccagagtgc cagtggctcc ttggaggtcg 
tggggcgccg gagtccacgc accggggatg 
gcaccaggag gtgatgatcc tgcaggaagt 
gaacaggttt ttgtactgat atcggcagct 
tctatacttg gaccgttttt ccccaaagag 
atcggtatga tctttggatg ttttgctttg 
aactatctgg tacatattgg agcaaaattt 
ggagttacaa ttctctttgg tgtattggac 
atgtgttttc tagtgagagt aatggatgca 
tcttctatcc tggcaaaggc ttttccaaat 
actttttctg gactggggct aatactaggt 



agaaccagaa tgtgaacaaa gttctgccac 720 
tatttcaagt tccatttctg aaagtggggg 780 
ggctgcctgg gtcatggttt gcttggctat 840 
catatatttt agttctctgt ttccatatgt 900 
ccttttaaat ggttcaattg atggcattcg 960 
gctggagccc aaggtctgga gagaagctgc 1020 
atttggtggt gtcattgcct tttcaagcta 1080 
tgctgtcctg gtgtccttca tcaatttttt 1140 
tgcagttctg ggcttcaaag caaatgtcat 1200 
gacgatcatg aaatttttga aaatggggaa 1260 
caacctttca actgttactg cagaagatta 1320 
gaaagaagaa gagtttcctg ctcttcatct 1380 
taaagctgtt caggggaccg gcttagcttt 1440 
tcctgcatct cccttctggt cagtgatgtt 1500 
cagtatgttt ggaaccattg aagggattgt 1560 
gaaagaaatt cttactgtta tctgttgtct 1620 
gcaacgctct ggaaattact ttgttacaat 1680 
gctaattgta gtcattttgg agaatattgc 1740 
tatggaagac ctaaaagata tgctgggctt 1800 
gaaatatatt tctcctctaa tgctattatc 1860 
attaagtcct cctggctata acgcatggat 1920 
ctatccaaca tggggactgg ttgtttgtgt 1980 
ccctgtagtt ttcattgttc gtcgcttcaa 2040 
atctgtgacc tataagagag gaagggtcct 2100 
tacaagcctc attcacggaa aaataccgag 2160 
tatttatcga aaacagagtg gatccccaac 2220 
aatagggtac ttgatggcag atattatgcc 2280 
ggaaaagtca gtgggtttta tttggttcat 2340 
gaagcattag gcttcactta tcagagggca 2400 
atcctaacag tatatgtcag ttcaacttga 2460 
gtgcagaaag gattgcagac aciatcttagg 2520 
tttttaagta cctttggtgt attttcaaat 2580 
tcagtttcta ataatttctg ggtttagtag 2640 
atttataagt tgccttcagg gtaacttcca 2700 

2709 



cctgagtagc tgagattaca gtgcttttat 60 
gaaaattggt tttgaatggg cagcagctat 120 
agccatgggc ttgttttttc tccttgaata 180 
tccacttatc agagcccata gaccttgtat 240 
atacagaaat ccacccagcg ttggtcccat 3 00 
aagacctgat acaagtccca atgtacttaa 3-60 
actgagaatt tccgggaaag ttggaattat 420 
taatatcagc accagcagcc agagctgact 480 
tcctggaagg ctttggggag ctccgggccg 540 
tgccaggttc cagctgagca gcggccgccc 600 
agtccaagga cgtggcttga agccgggagc 660 
gaggcgctgg gtgacctgga gggaccacgc 720 
gcaggagaga cccccgggtg gctttcgaga 780 
tcggtgaact taggttccat gatgtgctat 840 
gctgaaaaga agggagccag caatacaatt 900 
ttcgagttgc tggcatcctt ggtatttgga 9 60 
atgtttgtag caggaatgtt tgtctcagga 1020 
cgagttccag atgggccagt atttattgct 1080 
gttagctttg ctgcagcaat gactgcatct 1140 
aacgtggcta cggtattggg aagtcttgag 1200 
cctcctgtag gtggcttttt gtatcaatcc 1260 
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tttggctatg aagtgccttt tattgttctg 
aatatgtata ttttacccaa ttacgagtct 
atcgctttac ccaaagttgg ccttatagcc 
ggcttcctcg atcctactct gtctctcttt 
tatgtgggac tagtattcct gggtatggca 
ggtctcctaa gtgataaaag gccacctcta 
atcacagccg ggtgctacat gctcttaggg 
ctctggctgc tggtgctgat attagttgta 
ccaactttcc cggaaattct cagttgtgca 
acattgggac ttgtatcagg tctttttagt 
ccaacgctgg gtggatttct gtatgagaaa 
ggtctatggg ctctgataag tggattagcc 
aggagaaaaa ggtctaaatc tcaaaacatc 
ttgcctaatg aaacctagtc cgatggatcc 
tcctggcctt aaacatcacc gtaggaaggg 
gaccccgtgc cagtgtcttg gaagtgtcaa 
tacttactgt gatactgaaa agctgtcctg 
tgaaaggagt aattaaaaac aagcciaaaca 
ttgtccttaa agatgttgtt attaactcga 
attctaagta cactgattct gtgaatgtac 
aatttgatat gctctaaata cataaaggtg 
atcagttttt aaaagatata ctttctcttt 
gaagaattct tgcctgccat acagaaactc 
aaaattctgt ttgtgtgaaa agtacaagaa 
cctacgttca cttatgatct ggatttataa 
tttaatgtct ctgttttttg gctctaccat 
ggtagatgta tttcatccct agagcaggtc 
ttgttaggga agggcttcct ccaacttcgt 
tctattgtgt aaaaaaeia 

<210> 10 
<211> 2135 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> IncytG ID No: 71556695CB1 

<400> 10 

ggaaaagggg cttggctccc tctggaccct 
tgcaggcagt gtgaccgttg tgctgttatt 
cccaaaaccc cagccttaag agcaagtcca 
ccccgatgtg tgcagaatct gcagatgggg 
acaggcctcc aagtacccaa gaaagcaagg 
tgaaagattt ccacaaggac atcctgaagc 
ctgaggacga ggctgaggga aaacctccgc 
ttgtgctctc tgtggctggc ggcttcgtgg 
tctgctacaa gaatggtgga ggtgcgtttc 
gcggcctgcc tgtgtttttc ttggagatca 
tcacctgctg ggaaaagatc tgccccttgt 
ttgtgtccct cctgaatgtc tactacatcg 
tccagtcctt ccagaaggag ctgccctggg 
actgcatgga ggacaccatg cgcaagaaca 
acttcacctc ccctgtcatc gagttctggg 
tcgaccaccc aggctctctg aaatgggacc 
tgtgtttctt ctgcatctgg aagggcgtca 
ccacttttcc attcgccatg ctcctggtgc 
cgggcgcagg catcaagttc tatctgtatc 
tgtggattga cgctgggact cagatattct 
cctcgctggg gagctacaac aagtacaagt 
gatgcctgaa cagtggtacc agttttgtgt 
tcatggcaca agagcaaggg gtggacattg 
ccttcattgc ctacccaaaa gctgtgacaa 
ttttttttat tatgcttctc ttgcttggac 
agatcacatc cttggttgat ctttacccat 
tcttcatcgc cttcgtgtgt agcatcagct 



ggatgcgtcg ttttgctgat ggtaccactc 1320 
gatccaggtg aacactcatt ctggaaactg 1380 
ttcgtcatca actcactcag ctcgtgtttt 1440 
gttttggaga agttcaattt accagctgga 1500 
ctgtcctatg ccatctcttc accactattt 1560 
aggaaatggc ttctggtgtt tggcaactta 1620 
cctgtcccaa tcttgcatat taaaagtcag 1680 
agtggcctct ctgctggaat gagtataatt 1740 
catgaaaatg ggtttgaaga gggattaagt 1800 
gcaatggggt caattggtgc ttttatggga 1860 
attggttttg aatgggcagc agctatacaa 1920 
atgggcttgt tttatctact ggagtattca 1980 
ctcagcacag aggaggaacg aactactctc 2040 
tggattgata caaggttgag aaatgaatgc 2100 
tttttaaaat tttacgcgca aaactccgtg 2160 
cgtgtttttg gatgatcctg tattgggctg 2220 
ctgaagcagc tatatttgaa atattaagta 2280 
aaacaagact tagtttttaa atgaccaaac 2340 
gttagttctt atttcctctg tttatttttt 2400 
cttttttatt aacagggaaa gaaatgaatt 2460 
cttcaaaata tgtagaaaca ttactatgaa 2520 
gtcctgaggt ttttcggtct tgttcaaaag 2580 
tctagcactc cctgacctta agcttttcta 2640 
taacaatact tacaacttcc atttttgtaa 2700 
acattacttg gtataacgtt tttcatttcc 2760 
ctgttttgtt tttgttttta tctatatctt 2820 
agcctccttc ccctaatgcg aatgcttgtt 2880 
gtgaaattgt gatgttgaag tgaataaatg 2940 

2958 



tcacccagcc cctgcaccct ggagcagagc 60 
ggcctaaaca catagcaaca ggaaaacaac 120 
acagtgcgcg actccccagg tcaggagaag 180 
ctgctctcgg ccctgcctct ggctgtggaa 240 
agatggccac caaggagaag ctgcagtgtc 300 
cctcaccagg gaagagccca ggcacgcggc 360 
agagggagaa gtggtctagc aagatcgact 420 
gcttgggcaa cgtctggcgc ttcccgtacc 480 
tcataccgta ttttattttc ctgtttggga 540 
tcataggcca gtacacctct gaagggggca 600 
tctctggtat cggctatgcc tccgttgtaa 660 
tcatcctggc ctgggccaca tactacctgt 720 
cacactgcaa ccacagctgg aacacacctc 780 
agagtgtctg gatcaccatc agctccacca 840 
agcgcaacgt gctgagcttg tcccctggaa 900 
tcgctctctg ccttctttta gtctggctag 960 
ggtccactgg gaaggtcgtc tacttcacag 1020 
tgctggtccg agggctgacg ctgccgggcg 1080 
ctgacatcac ccgccttgag gacccacagg 1140 
tctcttatgc catctgcctg ggggctatga 1200 
ataactcgta cagggactgt atgctgctgg 1260 
ctggcttcgc aattttttcc atcctgggct 1320 
ctgatgtggc tgagtcaggt cctggcctgg 1380 
tgatgccgct gcccacattt tggtccattc 1440 
tggatagcca gtttgttgaa gttgaaggac 1500 
ccttcctaag gaagggttat cgtcgggaaa 1560 
acctgctggg gctgacgatg gtgacggagg 1620 
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gtggcatgta tgtgtttcag 
gggttgcatt ctttgaatgt 
atggtattga ggacatgatt 
tgatcactcc agttctctgt 
tgacctacaa caaaacatac 
ccctttcctc catgctctgc 
ggccgttcct tgtgagagtc 
tggagcgcga gggagccaca 
aaccgaccca catcattgtg 

<210> 11 
<211> 1997 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 7472800CB1 

<220> 

<221> unsure 
<222> 1810 

<223> a, t, c, g, or other 
<400> 11 

gagcgggagg ggaggcttcg cggaacgctc tcggcgccag gactcgcgtg caaagcccag 60 
gcccgggcgg ccagaccaag agggaagaag cacagaattc ctcaactccc agtgtgccca 120 
tgagtaagag caaatgctcc gtgggactca tgtcttccgt ggtggccccg gctaaggagc 180 
Gcaatgccgt gggcccgaag gaggtggagc tcatccttgt caaggagcag aacggagtgc 240 
agctcaccag ctccaccctc accaacccgc ggcagagccc cgtggaggcc caggatcggg 300 
agacctgggg caagaagatc gactttctcc tgtccgtcat tggctttgct gtggacctgg 360 
ccaacgtctg gcggttcccc tacctgtgct acaaaaatgg tggcggtgcc ttcctggtcc 420 
cctacctgct cttcatggtc attgctggga tgccactttt ctacatggag ctggccctcg 480 
gccagttcaa cagggaaggg gccgctggtg tctggaagat ctgccccata ctgsiaaggtg 540 
tgggcttcac ggtcatcctc atctcactgt atgtcggctt cttctacaac gtcatcatcg 600 
cctgggcgct gcactatctc ttctcctcct tcaccacgga gctcccctgg atccactgca 660 
acaactcctg gaacagcccc aactgctcgg atgcccatcc tggtgactcc agtggagaca 720 
gctcgggcct caacgacact tttgggacca cacctgctgc cgagtacttt gaacgtggcg 780 
tgctgcacct ccaccagagc catggcatcg acgacctggg gcctccgcgg tggcagctca 840 
cagcctgcct ggtgctggtc atcgtgctgc tctacttcag cctctggaag ggcgtgaaga 900 
cctcagggaa ggtggtatgg atcacagcca ccatgccata cgtggtcctc actgccctgc 960 
tcctgcgtgg ggtcaccctc cctggagcca tagacggcat cagagcatac ctgagcgttg 1020 
acttctaccg gctctgcgag gcgtcggttt ggattgacgc ggccacccag gtgtgcttct 1080 
ccctgggcgt ggggttcggg gtgctgatcg ccttctccag ctacaacaag ttcaccaaca 1140 
actgctacag ggacgcgatt gtcaccacct ccatcaactc cctgacgagc ttctcctccg 1200 
gcttcgtcgt cttctccttc ctggggtaca tggcacagaa gcacagtgtg cccatcgggg 1260 
acgtggccaa ggacggtgag cccctcctgc tgcacctggg cctgctccgt gtagcaccag 1320 
cgccgagctc tcagcaaacc tcagcctggg ccgtggtctt cttcatcatg ctgctcaccc 1380 
tgggtatcga cagcgccatg ggtggtatgg agtcagtgat caccgggctc atcgatgagt 1440 
tccagctgct gcacagacac cgtgagctct tcacgctctt catcgtcctg gcgaccttcc 1500 
tcctgtccct gttctgcgtc accaacggtg gcatctacgt cttcacgctc ctggaccatt 1560 
ttgcagccgg cacgttgatg gacactgatg ccacctcttc tccctcaggt gttgggcagt 1620 
tcagcgacga catccagcag atgaccgggc agcggcccag cctgtactgg cggctgtgct 1680 
ggaagctggt cagcccctgc tttctcctgt tcgtggtcgt ggtcagcatt gtgaccttca 1740 
gaccccccca ctacggagcc tacatcttcc ccgactgggc ceiacgcgctg ggctgggtca 1800 
tcgccacatn ctccatggcc atggtgccca tctatgcggc ctacaagttc tgcagcctgc 18 60 
ctgggtcctt tcgagagaaa ctggcctacg ccattgcacc cgagaaggac cgtgagctgg 1920 
tggacagagg ggaggtgcgc cagttcacgg tgaggtcgag gtccctgctg ggcctctctc 1980 
gggggaattc agaatga 1997 

<210> 12 

<211> 2774 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



ctctttgact actatgcagc 
tttgttattg cctggatata 
ggctatcggc ccgggccctg 
gttggatgtt tcatcttctc 
gtgtacccca actgggccat 
gttcccttgg tcatcgtcat 
aagtacctgc tgaccccaag 
ccttacaact ctcgcaccgt 
gagaccatga tgtga 



tagcggtgta tgccttttgt 1680 
tggaggtgat aacctttatg 1740 
gatgaagtac agctgggctg 1800 
gctcgtcaag tacgtacccc 1860 
tgggctgggc tggagcctgg 1920 
ccgcctctgc cagactgagg 1980 
ggaacccaac cgctgggctg 2040 
catgaacggc gctctcgtga 2100 

2135 
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<223> Incyte ID No: 7475553CB1 
<220> 

<221> unsure 
<222> 2666 

<223> a, t, g, or other 

<400> 12 

cgagccggag ccgagcgcgc cgaggccggg 
aactcgctac agttcgtgtt cgcctgcatc 
cgattcccgt acctgtgcca gatgtacggc 
atgcttatcg tggagggaat gccgctcttg 
cggcagggca gcatcggcgc ctggaggacc 
gccagcgtgg tggtctcttt cttcctctcc 
ttctggtacc tcttccactc cttccaggat 
ggtaaccaca cgggctacga tgaggagtgc 
tacaggaaaa ccctcaatat ctcgccgfccc 
ccggcgctgt gcctcctcct ggcctggctg 
gagtccactg gcaaggtggt gtatttcacg 
tacctcatca ggggcctcac gctccacgga 
cccaagatag agcagctggc caaccccaag 
ttctcacttg gcctgggctt cggcagcctg 
aacaactgcc agaagcacgc catcatcgtg 
gccagcattg tcaccttctc catctatggc 
ttgaagaagg tgagtctgct gctgaccaac 
gccagcaacc tggagcaggt gaagggctac 
gaga tgt tec cgcaaatcaa aaactgcagc 
ggcactggcc tggcattcat cgtctacaca 
ctgtggtcgg tgctctactt cttcatgctg 
aacacagcgg ccatcctcac ccctctgaca 
aaggaggcca tctcaggtct ggtgtgcctt 
atggaggctg ggaactactg gtttgacata 
ctgctcatcg tgctggtgga gacgattgcc 
gaaagtgacc ttaaggccat gaccggccga 
gctggcgtaa gcccactgct gat tgt cage 
ctcacgggga ccctgaagta tcaagcctgg 
gattacccgg cctatgcact ggctgtcatc 
atccccctgg cggccctggg gacttttgtt 
cccgtggcct gagatgtggg cttcccagcc 
cggaaactcc tcggctgctt tttcaaatgc 
cctgagtgtc taaagaagat gaggaaggtg 
acaccctccc gtggtggctg ttcctccctg 
ggctatgaaa gccggtctca aagataactg 
agggcacatt gcaaactgaa atatgactat 
ttgtttttgg ctgaagagac accaaaatat 
ggagttttga agtgcctaag atgacctatt 
ttttgagttg tagtgaatat gcaatttctg 
tgacaagtca taaagaaaga agatctttgt 
ggtctctgac agtgggtcca ggaccagacc 
tgcaacccca aaccacatta gaagtctgtg 
gcattggaaa aggagtcagg agccactgtg 
cacggtggct cacgcctgta atcctagcac 
ggtcaggagt ttgaaggcag cctggncaaa 
aacaaattag ctggcattgg gggagggttg 
gacccagggg gggg 



gccatggaga aagcgcggcc gctgtgggcc 60 
tcgtacgccg tgggcctggg caacgtgtgg 120 
ggaggtagtt tcctggtccc ctacatcatc 180 
tacctggaac tggctgtggg gcagcgcatg 240 
atcagcccgt acctcagtgg tgtcggggtc 300 
atgtactaca acgtcatcaa cgcctgggcc 360 
cccctgccgt ggtctgtctg cccactgaat 420 
gagaaggcgt cctccacaca gtacttctgg 480 
ctccaggaga acgggggtgt gcagtgggag 540 
gtggtgtacc tgtgcatcct gcgtggcacc 600 
gcgtcactgc cctattgcgt gctcatcatc 660 
gccaccaatg gcctcatgta catgttcact 720 
gcctggatca atgcagccac ccagatcttc 780 
atcgccttcg ccagctacaa tgagccatcc 840 
tccctcatca acagcttcac ctccatattt 900 
ttcaaggcca ccttcaatta tgaaaactgc 960 
acttttgacc ttgaagatgg ctttttgaca 1020 
ctcgcatctg cctacccaag caaatacagc 1080 
ttggaatcgg agctagacac ggccgtccag 1140 
gaggccatta aaaacatgga ggtgtcccag 1200 
ctgatgctgg gcattgggag catgctgggg 1260 
gacagcaaga tcatctccag ccacctgccc 1320 
gtcaactgtg ccattggcat ggtgttcacg 1380 
ttcaacgact acgcggccac actgtccctg 1440 
gtgtgctacg tgtacgggct gaggagattt 1500 
gc tgt gage t ggtactggaa ggtgatgtgg 1560 
ctctttgtct tctacctgag cgactacatc 1620 
gacgcctccc agggccagct cgtgaccaaa 1680 
gggctgcttg tggcctcctc caccatgtgc 1740 
cagcgtcgcc tcaagagggg agacgcagac 1800 
gctcacggtt ttacagatac tatttacagg 1860 
ttaagccagg agtgctcagc ccatcaactt 1920 
tgcaggaaga aaactccctt gggagaacgc 1980 
tcacctgcct cctcatcatg gaagggggtg 2040 
catccttcat tccaggaaag ccctagaatt 210 0 
aattcttatg ggaccaaatt taagcaattt 2160 
tagaggacaa atatttttag atccatttaa 2220 
tgtcagtggt gcaaaattaa ttctcttctt 2280 
tgttcccctt ccacGcttta aatcttagga 2340 
ctgggacccc caaagggatc ctttctctaa 2400 
tctctacaaa aaattgcccc aactacagtt 2460 
cagacatccc tccgtggtgt gtgtcttggt 2520 
aggtgagaat gaaagtggat ctcagctggg 2580 
cttgggggtc aaggtgggtg gatcacttga 2640 
atggcgaaaa cccatttcta ctaaaaatcc 2700 
gaggaatcga attaaagtta tggaaagggg 2760 

2774 



14/14 



